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THE LOGARITHMIC CURVE OF DISTRIBUTION 


By GeorceE R. Daviss, University of North Dakota 


Any one who has tabulated social and biologic data has observed 
that very commonly the distributions when plotted show a greater or 
less degree of positive skewness to which the conventional normal curve 
cannot easily be adjusted. He has perhaps also observed that when 
the frequencies are plotted on a logarithmic horizontal scale, they often 
appear to take the symmetrical form. This latter fact suggests that it 
may be possible to fit to such distributions a logarithmic normal! curve; 
that is, a Gaussian curve in which the successive units (oc) of the hori- 
zontal scale are readjusted to distances having a constant ratio rather 
than a constant difference. Such a curve might be expressed by the 
expansion of the binomial (1+ a)", in which the coefficients are taken 
as the ordinates, and the powers of a represent their respective positions 
on the value scale. It is the purpose of this paper to analyze the 
logarithmic normal curve, and to show that it may be satisfactorily 
fitted to a wide variety of data. It has been thought best to put the 
analysis in a descriptive rather than a strictly mathematical form, 
since the normal curve is used by many who are specialists in other 
fields than statistics. 

In order to state the theory of the logarithmic normal curve, we may 
consider first the transformation of the skewed curve into a symmetrical 
normal; and secondly, the reverse of this transformation. As an 
example, we may take Curve A (Figure I) which shows a high degree 
of positive skewness. In order to transform this curve to a sym- 
metrical form we re-plot it on a logarithmic scale as indicated by curve 
B in the same figure. This apparently gives a normal curve, but it is 
not the required equivalent, since in changing the base we have dis- 
torted the area and the relative frequencies. Consequently it is 
necessary to make an adjustment to rectify this distortion. If we con- 


1 The term “logarithmic normal”’ is used because the type of distribution to which it applies is normal 
in the usual sense if originally tabulated in classes having logarithmic limits. 
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ceive the original curve A to be composed of a series of narrow rec- 
tangles (frequencies) extending from the base to the point in the curve 
vertically above, we can see that the transformation has widened these 
rectangular elements in the left portion of the curve and narrowed them 
at the right. It will therefore be necessary to adjust the height of each 
rectangle so as to preserve the same area as before. This may obvi- 
ously be done by finding the area of each rectangle on the original base 
and dividing it by the width of the new base. The curve obtained as the 
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FIGURE I 


A, logarithmic normal curve; B, the same transferred to a logarithmic value scale; 
and C, the same adjusted in height to maintain original area. M, geometric mean or 
median of curve A and C. Arithmetic scale (A) and logarithmic scale (C) equate 
M and 2M on the former with M and e M on the latter. 


result of this adjustment in height is plotted as C. This curve has its 
mode farther to the right, and is apparently symmetrical about the 
mean, M. Its area apparently is the same as that of the original curve 
A. In order, however, to obtain this equality of areas in the plotted 
figures, it was necessary to use comparable scales. The desired result 
was attained by drawing scales (A) and (C) so as to equate the mean 
and twice the mean on the former, with the mean and 2.718 (that is, e) 




















i 
3 
i 
| 
j 


ODO wt me mK FLD 


re! 





















































3] The Logarithmic Curve of Distribution 469 





times the mean on the latter, respectively... Thus successive intervals 
on the former scale are valued in arithmetic progression, and on the 
latter scale in the geometric progression of logarithmic growth (e). 
At the mean the intervals are valued equally. The precise effect of 
this arrangement will appear later. 

It should be noted at this point that the original curve A is best 
represented by a geometric mean or a median, both of which are 
theoretically identical with the mean (M) of curve C. It is also 
important to note that the standard deviation of curve C is paralleled 
by a ratio deviation in A, and that this ratio form of ¢ is the only 
logical one to use in the latter case. The mean and the standard 
deviation may readily be computed by the usual method applied to 
the frequencies and the logarithms of the class marks. The frequencies 
should be taken as originally tabulated, since in this form they are 
proportional to the areas of the rectangular elements representing them. 
The mean and standard deviation thus obtained are the logarithms of 
the geometric mean and standard deviation ratio of curve A. The 
latter term we shall indicate in this paper by the symbol o,. In ex- 
planation of this term it may be said that the data plotted give the 
ratio of 1.649, which means that in curve A the successive values of 
the standard deviation units relative to the mean (10) are: 


Value 


o Value o 
—3 2.23 0 10.00 (M) 
—2 3.68 1 16.49 
—1 6.07 2 27.18 


The first five of these points are plotted without designation on the 
upper edge of the base line, according to the arithmetic scale, in Figure 
I. If plotted at intervals represented by the ratio scale (C), they would 
be at equal distances (S) apart, as expressed in units of scale A by the 
equation, 

q s-ms Or _ 19> 0-21715 _ 

log e 0.4343 

The area of a logarithmic normal curve similar to curve A, but ex- 
pressed in ordinary frequencies having a given class interval (J), may 
therefore be expressed in standard deviation units, as in tables, by the 


equation Area = 2X I 
: S 


- 


in which =f means the sum of the frequencies as tabulated. 
1In general, a point on one scale equating to a given point on the other scale may be found by the 


relation, A/ M = log C/M +1in which A and C are equated points on the scales (A) and (C) respectively, 
0.4343 


and M is the geometric mean. 
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We may look at the transformation of the symmetrical normal curve 
into the logarithmic type more analytically if we begin with a curve 
which forms a symmetrical parabola as plotted on double logarithmic 
scales (curve C, Figure II). The parabola is of the second degree, 
having its apex above the zero point of a standard deviation scale, and 
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FIGURE II 


Logarithmic normal curve, C, adjusted to double logarithmic scales, the horizontal 
scale representing powers of the standard deviation ratio. B, the same curve with 
height adjusted for transference to arithmetic value scale as in curve A of Figure I. 
Oblique line, rn, equating points on standard deviation scale with corresponding 
points on arithmetic scale; sm’, the same but referring to deviations about the mode. 
For these lines, the vertical scale to the right is read times 1000. 


represents the ordinates of a normal curve of unit area as given in the 
conventional tables. The equation of the parabola is of the type 


y=a+br+cr 


but in this case b = 0, anda and c have been given logarithmic values 
corresponding to the ordinates of the normal curve of unit area, making 
the equation, 
y=log 0.3989—0. 5(log e)z? 
or y= —1+0.60091 —0. 217152? 


It will be observed that the horizontal scale is plotted at arithmetic 
intervals, but since the standard deviation units as represented on that 
scale are taken to be powers of the standard deviation ratio, the scale is 
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in effect of the ratio type. We may speak of the curve, there- 
fore, as drawn to a double logarithmic scale, although in fact, semi- 
logarithmic paper was used. We have, obviously, in this curve the 
equivalent of curve C of Figure I, except for the change in the vertical 
scale. 

If the curve C of Figure II is to be transformed into an equivalent 
logarithmic normal curve on arithmetic scales, such as curve A of the 
preceding figure, adjustments of the ordinates must obviously be made 
corresponding inversely to the adjustments for area previously de- 
scribed. The transformation to the new horizontal scale will progres- 
sively narrow the widths of the standard deviation rectangular elements 
to the left, and correspondingly lengthen those to the right. This 
change will require that the height of the curve be appropriately ex- 
tended in the first half and diminished in the second half. Such an 
adjustment may conveniently be made in Figure II, as represented by 
curve B, the construction of which will now be described. 

It may easily be shown that the arithmetic series designating the 
limits of rectangular elements on the base of curve C will constitute 
a geometric series when transferred to an arithmetic scale. Their 
first differences—namely, the widths of the successive rectangles— 
will therefore also constitute on the arithmetic scale a geometric series 
having the same rate of increase, and this series may be multiplied by 
a constant such that the central value at the point 0 is unity. This 
geometric series of first differences as thus modified constitutes the 
divisors used in adjusting the ordinates to the changed area. The 
required divisions, with the widths of the rectangles taken as infinitely 
small, may be graphically represented on the logarithmic scales by 
transferring curve C to the oblique base, BO (assumed to be extended 
entirely across the parabola) which is drawn at a slant represented 
negatively by the logarithm of the standard deviation ratio, 1.649. 
By combining this slant with the ordinates of curve C, we obtain curve 
B, the equation of which is,! 


y= —1+0.60091 —0.217152—0. 217152" 


It is important to determine the position of the mode in curve B, 
since this position will give us the relation between the geometric mean 
(or the median) and the mode in the logarithmic curve as plotted on 
arithmetic scales. This position is determined by differentiating the 
equation of the curve expressed in its general algebraic form, and 
equating the slant thus obtained to zero. This procedure gives,” 


1 If the position of the mode on the @ scale is designated by m, the equation becomes, 
y =log 0.3989+m (log e)r—0.5 (log e)z? 
? As will appear later, the mode is located on the arithmetic scale at antilog (— b?/0.4343) times the 
geometric mean. 
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r= —— or, go & 
2c log e 
which in terms of the data here used may be expressed 
_ 0.21715. 
0.4343 —~—~O 


indicating that the mode is located above the point —0.5 (¢) on the 
horizontal scale. By a simple algebraic process the height of the mode 

of curve 8, in logarithms, may be shown to be 
a——, or a+ loeer)" 

4c 0.8686 

On the basis of the foregoing calculations, it is now possible to demon- 
strate that the curve B obtained by adjusting the ordinates of curve 
C is of the same type as the latter, having been merely shifted to a dif- 
ferent position on both of the scales. This similarity may be inferred 
from the equation of the curve, and may also be shown precisely by 


expressing the general equation of the parabola, 
y=at+br+cr 


b2 b 2 
W_ r+d2+2) 
the identity of which with the preceding equation is easily shown by 
expanding the last term and simplifying. The latter equation indi- 
cates, for negative values of b and c, a quadratic parabola similar to 
C, but adjusted uniformly to a higher point on the vertical scale, and 
shifted to the left on the horizontal scale. In transforming a sym- 
metrical normal such as is given in tables into a logarithmic normal, we 
may therefore consider that the adjustment of the relative heights of the 
ordinates is made when the position is suitably shifted to the left. 
And the uniform adjustment of height may be made by multiplying 
the normal ordinates by the factor, anti-log (—b?/4c), as previously 
indicated. 

It will be worth while, also, to determine the position of the arithmetic 
mean of a logarithmic normal curve with respect to the horizontal 
scales. This position may be determined by the following considera- 
tions. As computed on the arithmetic base, the average would be the 
sum of the frequencies times the values, divided by the sum of the 





in the form, 


1 Incidentally, this analysis is of use in the computation of growth trends such as the Gompertz curve. 
Since the first differences of data conforming to a growth curve form a normal distribution, the apex of 
this distribution as determined by fitting a logarithmic parabola locates the central date of the growth 
curve. The Gompertz formula may then be applied, or the computation may be completed on the 
basis of the logarithmic parabola of first differences. 
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frequencies. On a logarithmic chart (Figure II), the former sum, 
with the median taken as unity, would be represented by the integra- 
tion of a curve similar to B, but located to the right rather than to the 
left of the central point. The second sum, that is, the frequencies, is 
represented by the integration of the curve C. The quotient obtained 
by dividing the former sum by the latter is represented by the uniform 
difference between curve B and curve C as measured on the vertical 
scale. This difference has already been stated as — b?/4c, or b?/0.8686. 
The position of the arithmetic mean on the arithmetic scale is therefore 
the geometric mean times the anti-logarithm of 6?/0.8686. In other 
words, the arithmetic mean of the logarithmic normal curve diverges 
from the median in the same ratio that the ordinates increase uniformly 
in height over the ordinates of the corresponding symmetrical normal. 
On the standard deviation scale, the position of the arithmetic mean 
is located by dividing the factor just given by the logarithm of a,, 
that is, by —b. This gives the position as b/4c. Comparing this posi- 
tion with that of the mode, we observe that it is at half the distance 
from the center, and in the opposite direction. This result confirms 
the usual observation that in moderately skewed curves the mode, 
median, and arithmetic mean are located in the order named, and at 
intervals having the ratio of 2:1. The ratio is stated exactly, however, 
if applied to the logarithmic intervals rather than to the intervals on the 
arithmetic scale. 

In concluding our analysis, we may turn again to curves C and A of 
Figure I. Assuming curve C as constant, and curve A as a variable 
with respect to the degree of skewness, we see that the former repre- 
sents the limit of the latter at the lowest degree of skewness, where 
o, is reduced almost to unity. In such a case the (A) and (C) hori- 
zontal scales of that figure are both extended so that the zero point 
(and M/e equated with it) is removed to a maximum distance to the 
left. We may consider logarithmic curves of various degrees of skew- 
ness as generated by a gradual increase of o,, which in turn contracts 
uniformly the two scales (A) and (C), bringing the zero point con- 
stantly closer to the center, M. As this change occurs, the arithmetic 
mean shifts from the center to the right, while the mode rises at the 
same rate and simultaneously approaches its limit, the zero point. On 
passing the maximum of skewness (¢,= © ) the horizontal scales are 
reversed, and negative skewness, having the same characteristics as 
positive, is generated.! 


1 The degree of skewness (Sk) may be expressed as a coefficient by the formula, Sk = (¢r—1)/(@r +1). 
If, however, the quartile ratio (Qr) is substituted for or in this formula, the coefficient is identical with 
that expressed by the usual formula Sk =(d:—di)/(d2+di), in which d: and d: are the differences be- 
tween the first and second quartiles, and the second and third quartiles, respectively. The relation of 
Qr to or is expressed by, log Qr =0.67449 log or. 
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We may now consider the methods by which the logarithmic normal 
curve may be fitted to data. Let us assume the frequency distribu- 
tion given in Table I. The first step is to find the logarithms of the 
geometric mean and the standard deviation ratio. These may be 
calculated by substituting for the class marks their logarithms, and 
computing the dispersion about the average in the usual way. The 


TABLE I 
FITTING A NORMAL LOGARITHMIC CURVE TO A GIVEN DISTRIBUTION 


(Geometric mean, 10; Standard deviation ratio, 1.649; Mode, 7.79.) 

















‘ " Deviation (¢) Ordinates Ordinates 

Class Class Mark Frequency from mode (table) adjusted 
OF Sf are 5 27 —0.89 0.2685 30.4 
fl & ee 10 39 0.50 0.3521 39.9 
ff ee 15 22 1.31 0.1691 19.2 
17 .5-22.5 20 7 1.89 0.0669 7.6 
23.5-27.5....... 25 3 2.33 0.0264 3.0 
~ 2 3 aaa 30 1+ 2.70 0.0104 1.2 
cf Sf ee 35 1— 3.01 0.0043 0.5 

100 0.8977 




















1With a small number of classes there is introduced an appreciable quadrature error, which may, if 
necessary, be treated in accordance with the principles usually employed. 


average and standard deviation thus found are the logarithms required. 
Since the computation is a familiar one, it has been omitted from the 
table. The anti-logarithms of the logarithms thus found give the 
geometric mean (M = 10), and the standard deviation ratio (o,= 1.649). 

The normal logarithmic curve may now be easily fitted to the data. 
As already explained, it should be made to center about the mode, 
since in so doing the ordinates are automatically adjusted to the dis- 
torted area of the logarithmic curve. The logarithm of the mode 
(Mo) is readily found by the equation, 


(log o,)? 


log Mo=log M— 
. . 0.4343 


The deviation (d,) of each class mark (V) from the mode, expressed 
in units of the standard deviation, may be found by the usual pro- 
cedure, which is here stated by the formula, 


_ log V—log Mo 





dg 

log o, 
The deviations thus found are given in the fourth column of Table I. 
The ordinates corresponding to these deviations, as read from a table 
of the normal curve of unit area, are given in the succeeding column. 
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As a final step it is necessary to adjust the ordinates of the unit 
curve for area, and also for the increased height of the logarithmic as 
compared with the symmetrical normal curve. The area is expressed 
by the formula, 

0.4343 x C 


logo, M 








Area = =fX 


The data of Table I give =f=100, the class interval (C)=5, and the 
geometric mean (M)=10. The area is therefore 100. The addi- 
tional height factor for which adjustment must be made is the ratio 
of the height of the logarithmic curve to its correlative symmetrical 
curve, as expressed by the formula, 


(log o,)? 
0. 8686 


This gives a ratio of 1.133. Combining the two factors of area and 
height, we have 113.3 as the factor by which the ordinates of the curve 
of unit area are to be adjusted. Using this multiplier we obtain the 
adjusted ordinates given in the final column of Table I. If other 
points in the fitted curve are required for accurate plotting, as at the 
class limits, they may be readily found by the same procedure applied 
to the deviations indicated.! 

If approximate results are sufficient, there are certain graphic short 
cuts which may be substituted in part or in whole for the process just 
described. One of these methods may be applied to the determina- 
tion of the mode and the deviations of the class marks from the mode, 
as illustrated in Figure II. The procedure is as follows: draw an 
oblique slant line, as rn, through the geometric mean at the center of 
the standard deviation scale, having a slant indicated by the loga- 
rithms of o,. The intersection of this line with the ordinate of the 
mode (—log o,/0.4343=—0.5), at n will give the value of the mode 
(7.8), as read on the vertical logarithmic scale. It should be noted, 
in reading this scale, that the decimal points are adjusted for the 
parabolas, and should here be read “times 1000.”” A second oblique 
line, sm’, may now be drawn parallel to the first one and passing through 
the mode at the center of the standard deviation scale (m’). This is 
equivalent to shifting the standard deviation scale so that it centers 
at the mode. By means of this oblique line the equivalent of any 
value on the vertical logarithmic value scale may be read on the 
standard deviation scale, and vice versa. The process is the graphic 
equivalent of the formula by which the deviations of Table I were 
located, and may be stated in more general form as 


Height ratio = anti-log 


1 Curve A, in Figure I, was plotted from the curve thus obtained. 
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log = 
m 





$= 

log o, 
in which s is any required correlative point on the standard deviation 
scale, a is the given correlative point on the value scale, and m is the 
median, mode, or other point taken as the center. 

A very convenient graphic method, covering practically the entire 
procedure of fitting the normal curve to data, may next be described. 
This method requires the use of logarithmic probability paper, such 
as may now be obtained from publishers. The use of this paper is 
illustrated in Figure III. The data relate to the distribution of 569 
farms on the basis of average yields during the years 1911 to 1921. 
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FIGURE III 


Distribution of certain farms according to yield per acre. Summation points, 
beginning at lower limit of yield, plotted on logarithmic probability paper. For data, 
see Mills, Statistical Methods, page 114. Straight line fitted to summation ‘points 
by inspection. 


The frequencies as presented in the data are first reduced to percent- 
ages, and then written as a summation column (ogive), beginning at 
zero. This summation column is plotted on the logarithmic proba- 
bility paper against the upper limits of the respective classes. If the 
points thus plotted form a straight line, the curve is normal of the 
logarithmic type. In the illustration given, the distribution is quite 
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irregular in the upper half, due, apparently, to irregular sampling. 
This irregularity may be smoothed by fitting a straight line to the 
points as plotted. Such a line may be satisfactorily fitted by inspec- 
tion, but it should be kept in mind that deviations of the data from 
this line are greatly exaggerated at the upper and lower extremes. It 
is therefore important to make the line conform closely to the central 
points rather than to the extreme points. A practicable method for 
drawing this line is to locate it by reference to the quartile points, 
which in turn may readily be located by graphic interpolation. In 
some cases, however, it may be better to interpolate the first standard 
deviation points above and below the median. As a rule, however, 
the straight line may be easily drawn by inspection, since it usually 
conforms more closely to the data than in the case of the illustration 
here given. After the straight line has been drawn, the summations 
of the normal curve may be read from it, and the first differences may 
be taken as normal frequencies. The values of o at +1 and 0 (2%: 
84.1 and 50) may also be read from this line. The ratio of these two 
values gives o, of the logarithmic normal curve as fitted, and the latter 
of the two values gives the median. The fit of the curve to the data 
may be seen in Figure IV. 








& 3 q 
' ' 1 


Forms-percent 


° 





0 5 10 1S 20 2s 
Bushels per acre 
FIGURE IV 
Frequency distribution of certain farms according to the yield per acre, and 
logarithmic normal curve (smoothed line) fitted to same. Data as in preceding 
figure, and normal taken as first differences of summation line as drawn in that figure. 


The logarithmic normal curve is a type very commonly met with in 
statistical work. As a rule it is illustrated in series which are definitely 
limited at the zero point or at some other fixed point, but in this 
respect there does not appear to be any precise mathematical rule. 
Thus the distribution of income in the United States in 1918, as 
estimated by the National Bureau of Ecomonic Research, includes a 
few negative values, but it nevertheless conforms rather closely to the 
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logarithmic type except at the extremes. Again, discrete series in 
many, if not all, cases follow the symmetrical normal rather than the 
logarithmic type. Still other series, such as some measurements of 
stature, fall at a position on the horizontal scale where the logarithmic 
and symmetrical curves are almost indistinguishable, and it is there- 
fore impossible to tell objectively to which type they belong. Occa- 
sionally, also, distributions will be found in which there is a degree of 
regular skewness which does not exactly conform to the logarithms of 
the value scale. In such a case it is comparatively easy to fit a skewed 
curve of the type described merely by adding a constant to the suc- 
cessive items of the value scale sufficient to bring the percentage 
summations on a logarithmic probability chart to approximately a 
straight line. In testing a considerable number of distributions, how- 
ever, covering) such diverse subjects as mental tests (see Figure V), 

od 
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Mental age - years 


FIGURE V 


Frequency distribution of mental age of fourth grade pupils in Illinois, with logarith- 
mic normal curve (smoothed line) fitted to data. 


incomes, housing conditions, mortality rates, pulse beats, ete., the 
writer has found very few in which the logarithmic normal curve, 
fitted as previously described, failed to give a satisfactory fit. 

The type of distribution exhibited in one or two special cases may 
be considered briefly. One of these cases is the dispersion of arrays 
of prices expressed as index numbers having a common base period. 
It has been questioned whether such dispersions conform to any 
regular law. Professor Kelley regards them as probably conforming 
more or less closely to one of the complex curves analyzed by Pearson. 
The data which he gives in illustration are index numbers of a specified 
year based on the preceding year. For such data he is apparently 
correct, but a comparison of similar distributions having a more remote 
base indicates a different law. Apparently the scattering of prices is 
a gradual process involving in the first year or two only certain un- 
stable items, while a considerable number of items remain close to the 
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established average. But after the forces producing change have 
come generally into play, the logarithmic normal dispersion is approx- 
imated. At least, such a result has been obtained in the few cases 
that have been tested. 

A dispersion approximating the logarithmic normal type with neg- 
ative skewness is illustrated in percentage scholarship grades. The 
logarithmic normal curve may readily be fitted to such data by reading 
the value scale in units of positive deviation from the 100 per cent 
mark. The result of such a computation applied to data cited by 
Professor Kelley (Statistical Methods, page 29) is shown in Figure VI. 
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FIGURE VI 
Frequency distribution of scholarship grades as quoted by Kelley in Statistical 
Methods, page 29, and logarithmic normal curve (smoothed line) fitted to data. 


It would, perhaps, have been better to have readjusted these data to 
wider class intervals, but even as they stand they indicate the type of 
' curve. In this case a secondary limiting point occurs at zero, but it 
has no appreciable effect on the distribution. It may be of interest 
to note that if this illustration is typical, letter grades should be trans- 
| formed to numerical grades on the basis of such a curve. With a 
| passing grade of 70 per cent, and a median close to 85 per cent, a five 

class distribution (combining E and F as one class) would range ap- 
proximately as follows: 
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Letters Per cents 
100-93 
92-89 
88-82 
81-70 
69-60 
59- 0 


SAmhoaOne 


Summarizing, we may say that a relatively large number of the 
distributions tabulated in statistical work form skewed curves ap- ; 
proximating the Gaussian type when re-tabulated or re-drawn to a 
logarithmic value scale. Such distributions should be measured about 
the geometric mean (or median) which is theoretically identical with 
the mode of the revised symmetrical figure. Deviations should be 
measured in units of the standard deviation ratio, which is the anti- 
logarithm of the standard deviation as computed from the logarithms 
of the class marks. The positions of the mode and arithmetic mean 
of the skewed curve are functions of the standard deviation ratio. {| 
Logarithmic normal curves may be fitted to these skewed curves either 
by computation or by graphic methods. Such curve-fitting may § 
appropriately be used in many fields of statistical work as a means of § 
checking the type of distribution and the adequacy of a given sample. 











+ UD capt Lae Ada a 


a aioe A es the wnt 








15] Size of Families from Which College Students Come 





SIZE OF FAMILIES FROM WHICH COLLEGE 
STUDENTS COME 


By WarREN S. THompson 


This study adds another to a number already published in which an 
attempt is made to ascertain the number of children born to parents in a 
particular group. Our group consists of people who send their children 
to college and our data come chiefly from the Middle West and the 
South. A special effort was made to have these two sections of the 
nation well represented because hitherto most of the data used in 
similar studies have come from the Northeast.! 

The information for this study was secured chiefly by teachers of 
college sociology from their classes. The author wishes to express here 
his great appreciation of the cordial codéperation of these teachers and 
also to absolve them from any responsibility for the use he has made 
of the data. 

Every effort has been made to eliminate error in the basic data, but 
on the face of things it appears that there are two errors of some im- 
portance: one cannot in any way be corrected, the other we have 
attempted to correct by going over the heads of a group of students 
to their parents. The first error consists in the tendency of students to 
list their fathers’ occupations in such a way that they appear to belong 
to an occupational group of higher social status then they actually do. 
This will probably account for the small numbers in the skilled, the 
unskilled, and clerical groups. The second error is in the understate- 
ment by the students of the number of children in both generations. 
In order to check up on this understatement, schedules filled out by 
about 400 Miami students showing the number of their own brothers 
and sisters and of their fathers’ and mothers’ brothers and sisters were 
compared with those filled out by their parents showing the number of 
children born to them and also the number of their brothers and sisters. 
It was found that the students’ answers understated the number of 
their own brothers and sisters a little less than one-tenth of a child while 
they understated the number of children in their parents’ generation by 
five-tenths of a child. In using the data for the number of children in 
the students’ generation no correction has been made because of this 
small error. In using the students’ data regarding the parents’ gener- 


1 Two studies, one relating chiefly to the Middle West and one to the Far West have recently been 
published: (a) R. E. Baber, and E. A. Ross, No. 10 University of Wisconsin Studies in the Social Sciences 
and History, Madison, 1924, and (b) S. J. Holmes, “Size of College Students’ Families (California)”’, 
Journal of Heredity, October 1924. 
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ation, however, five-tenths has been added in the discussion (not in the 
tables). 
The basic data for the two generations are given in Tables I and II. 
Several things stand out clearly upon examination of these tables. 
In the first place a comparison of the number of children in the families 
from which the students come (hereafter called students’ families) 
with that of the families from which their parents come (hereafter called 
parents’ families) shows that there has been a marked decline. Using 
the corrected average (obtained by adding .5 to the average given here, 
for the reasons stated above) for the parents’ families we find that the 
decline has been from 6.51 to 4.32 or 33.6 per cent. This is about 5 
per cent less than Baber and Ross! found in their study of native fam- 
ilies (38.42 per cent) but somewhat more than Holmes? found in his 
study of California students (29.8 per cent). If, however, we compare 
the averages in our study for the East North Central states with those 
of Baber and Ross whose data were gathered chiefly from those states 
we get a decline of 39.3 per cent which is only slightly in excess of theirs. 
When the comparison between the Pacific states in our study and 
Holmes’ data is made the decline we find is 38.5 per cent which is still 
greater than in the general comparison made above. This is probably 
due chiefly to the fact that our data for this section are inadequate. 
It is interesting to note that when compared with these other studies 
our absolute figures are higher in both cases. Baber and Ross found 
3.35 the average number of children in students’ families, Holmes 3.66 
and we found 4.32. These differences are undoubtedly due to the fact 
that the data were gathered on a different basis in each case. Baber 
and Ross investigated native families, chiefly in the north Middle 
West; Holmes studied University of California students born in all 
parts of the United States; and our data are heavily weighted by the 
Middle West and South and include foreign as well as native families. 
The parents’ families in all of these studies were filled chiefly during 
the 15 years following the Civil War and the students’ families during 
the decade 1895-1905. Thus it would appear that each year during 
this interval has seen about 1 per cent decline in the number of the 
children born to the ‘groups represented here. 


COMPARISON OF OCCUPATIONS 


The decline in the number of children by occupational groups for the 
whole country has by no means been uniform. In agriculture it has 
been 20 per cent, in the professions 34.2, in trade 34.3, in managerial 
occupations 37.4, in clerical work 36.8, in skilled occupations 34.8, and 


1Op. cit. 2 Loc. cit. 
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in unskilled 30.2. Baber and Ross found the percentages 29, 42, 39, 
39,! 38, 34, and 44 respectively. These percentages are not strictly 
comparable because of the different basis on which the material was 
collected. They are more nearly comparable if we use only the data 
from the East North Central states. These percentages are 30.3, 
37.4, 37.3, 40.3, 35.2, 37.6, and 25.2 for the occupational groups in the 
order given above, and more nearly correspond to those of Baber and 
toss.” 

Our study shows definitely that in every case where the data are 
adequate the largest number of children is found in the families where 
the fathers are engaged in agriculture. This is true in both genera- 
tions and for all sections of the country. Clearly, up to the time when 
the present college students were being born, farm families were feeling 
the pressure of the forces tending to reduce the birth-rate less than 
families living and working in urban communities. There can be little 
doubt that this is as true today as it was 20 years ago. In fact the 
percentages given above indicate that these forces were becoming 
relatively more intense in the cities during the interval between the 
birth of the parents and of the present college students. The trend is 
probably still in the same direction. 

In the parents’ generation the difference between the farmers’ families 
and the professional men’s families was 1.05 of a child, in the students’ 
generation it was 1.69, in the other occupations they were respectively, 
trade 1.30 and 1.85, managerial 1.04 and 1.85, clerical 1.60 and 2.17, 
skilled .74 and 1.50, and unskilled .35 and .91. These facts reénforce 
the conclusion expressed just above. There is some reason to believe, 
moreover, that the more prosperous farmers presumably with the 
smaller families are the ones who send their children to college while 
the poorer farm owners, tenants, and laborers with larger families 
would seldom send their children to college. This would also be true 
to a certain extent of the occupational groups living chiefly in the cities 
but it seems probable that the families sampled among professional, 
trade and managerial workers are a more typical cross section of these 
groups than the families sampled in the farming group. Among the 
skilled and clerical workers again (although our data are too meager 
to be of much value) the situation would probably be much the same as 
among farmers, so that the averages given here for these groups prob- 
ably tend to understate the number of children in the families of such 
workers taken as a whole. 


1 This assumes that their group called ‘ business”’ is practically the same as our groups “‘trade"’ and 
““managerial.”’ 

? It should be noted that the data on clerical and unskilled occupations are quite inadequate in our 
study. For this reason percentages based upon them have almost no value. 
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It must of course be remembered that our data apply only to fertile 
families. The average number of children born to all married couples 
in these groups would be considerably smaller. Such information as 
we possess ! indicates that childlessness is 2 to 3 times as common 
among city people as among rural dwellers which is just another proof 
that the causes of the decline in the birth-rate bear more heavily upon 
city dwellers than upon people living in the country. Clearly such 
differences cannot be ignored as they sometimes are when an attempt is 
made to get at the sources of natural increase in the nation. 


COMPARISON OF SIZF OF FAMILIES BY REGIONS 


For some parts of this comparison four groups of states, designated: 
The Northeast, including New England and the Middle Atlantic 
states; The Middle West, including the East North Central and West 
North Central states; The South, including the South Atlantic, the 
East South Central, and the West South Central states; and The West, 
including the Mountain and Pacific states, have been used. 

Unfortunately our data are not as adequate for the first and last of 
these regions as for the other two. However, such data as we have for 
the Northeast and the West agree fairly well with data gathered by 
other people in other studies, when due allowance is made for the fact 
that our data include foreign as well as native families and that they 
were gathered largely from Middle West institutions, thus representing 
a rather large proportion of migrant families from these regions. 

As our data for the students’ families stand, the South has a clear 
lead over other regions. The average number of children per family is 
44.1 per cent greater in the South than in the Northeast, 25.7 per cent 
greater than in the Middle West and 24.2 per cent greater than in the 
West. The South is both native and rural. Thus it appears that it is 
not so much the nativity of people that determines the size of the family 
as it is the degree to which industrialization has taken place. The 
differences between the East North Central and the West North Cen- 

1 See Report of Immigration Commission, “‘ Fecundity of Immigrant Women.” In Cleveland 15.2 
per cent of native white women of native parentage married 10-19 years had borne no children; in rural 
Ohio only 5.7 per cent of the women in the same group were childless. For immigrant women the 
percentages are, first generation, Cleveland 5.5, rural counties 4.6; second generation, 8.5 and 5.5. 
In Minneapolis 12.7 per cent of the native women were childless; in the rural counties of Minnesota 
only 5.1. For the first generation of immigrant women in Minneapolis the percentage was 6.5, in the 
rural counties of Minnesota 2.6; second generation Minneapolis 7.9, rural counties 2.8. 

2 The best general discussion of the causes of sterility in marriage is to be found in E. Reynolds and 
D. Macomber, Fertility and Sterility in Human Marriages, 1924, W. B. Saunders Co., Philadelphia. 
This discussion certainly leaves the impression that many of the involuntary causes of sterility are a 
consequence of urban conditions of living and would not arise under the simpler conditions of rural life. 
Elsewhere the author has discussed what he believes to be the most significant differences between urban 


life and rural life in this connection. See Scientific Monthly, Vol. 5, Nos. 1, 2, 3, July, August and Sep- 
tember 1917, and American Journal of Physical Anthropology, Vol. 3, No. 1, January 1920. 








-—_ it tee atc a s&s ot 


wh 


Cs 


4n TH -— fe 


— = 4 & fH et OM 


~ 


of- of act ct Kh 






































23] Size of Families from Which College Students Come 489 


tral states point in the same direction, the part of the Middle West 
west of the Mississippi River having an average 14 per cent higher than 
that east of it. The Middle West east of the Mississippi River is 
rapidly approaching the same general condition that prevails in the 
Northeast as regards the size of family. We know from census data, of 
course, that urbanization and industrialization are proceeding rapidly 
in this area. 

It is interesting to note, too, that Pennsylvania, one of the most 
highly industrialized states, seems to belong with that part of the 
Middle West west of the Mississippi rather than with the Northeast in 
the matter of the size of family. The data here are, of course, in- 
sufficient to prove this definitely and could be ignored if they stood 
alone. When, however, we approach the problem from other angles, 
and find the facts confirmatory, there seems good reason to suppose 
that in some respects Pennsylvania is different from its neighbor 
states. Thus we find that Pennsylvania had (1920) 457 children under 
5 years of age per 1000 women 15-44 years of age in its urban com- 
munities; in New York the number was 386, and in Ohio 403. In the 
rural communities the figures were: Pennsylvania 621, New York 447, 
Ohio 526. In other comparisons of a somewhat similar nature Penn- 
sylvania has from 8 per cent to 12 per cent more children than Ohio 
and an even larger percentage of excess when compared with New York. 
We also find that Pennsylvania has consistently had a somewhat higher 
birth-rate than its neighboring states, running as much as 25 per cent 
higher for some of the years since 1918, while the average excess would 
be between 10 and 15 per cent. 

I do not know how to account for Pennsylvania’s larger families ex- 
cept on the basis of its large mining population (miners always seem to 
have families of more than average size) and the character of its foreign, 
born population which contains a very large proportion of Slavs and 
Italians. Neither of these factors, however, should exercise much 
influence on the families of college students; for these groups send very 
few of their children to college. 


REPRODUCTION IN THE DIFFERENT GROUPS AND REGIONS 


Dr. Dublin has recently shown ! that it takes 3.1 children per fertile 
family to keep the population stationary. On this basis it might seem 
that the people who have children in college are more than reproducing 
their class in all sections of the country and in practically all occupa- 
tional groups. Before we can come to such a conclusion, however, 
there are several facts we must take into account. In the first place 
1L. I. Dublin, “The Statistician and the Population Problem,” this Journat, March 1925, p. 8. 
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Dr. Dublin’s estimate is for the present, not 20 years ago when the 
present college students were being born. No doubt 3.6 to 3.7 would be 
nearer the figure for that time. In the second place, it is a figure for the 
population as a whole and is too low for this class, because marriages 
are fewer and childless marriages are more numerous in the class of 
people who send children to college than in the general population. 
For these reasons the fertile families in this class probably needed 3.9 
to 4 children in order to maintain the numbers in the class as a whole. 
Now if we examine Table III which summarizes the averages by sec- 
tions and occupations, we shall be able to tell very easily where this 


TABLE III = THE AVERAGE SIZE OF FAMILIES OF COLLEGE STUDENTS 


Agri. Prof. Trade Manag. Skilled Clerical Total 


New England 4.66 2.97 3259 3039 5.81 - 35 
Middle Atlantic 4.22 5240 5e51 5-38 5-69 3244 55 
The Northeast 4.51 3029 5-55 3.58 5e71 5e51 305 


East North Central 4.66 3.48 3.47 3.62 4.04 3.95 386 


West North Central 5.34 4.08 3-71 3.43 3.89 3.42 4.40 
The Middle West 5.03 3-70 357 3e51 3.99 3236 4.08 





South Atlantic 6-61 4.46 4.47 4.56 4.95 3.93 5.51 
East South Central 6.15 4.66 4.14 4.04 4.29 2.78 5.05 
West South Central 6.09 4.72 3.91 4.50 4.42 3-42 5.05 

The South 6426 4.59 4.13 4.38 4.50 3242 5.13 
Mountain 5.72 312 3047 3259 3.79 Sede 4.09 
Pacific 5.64 3.87 Sece 3-00 3013 3250 3.94 

The West 5.70 Seol 3044 3047 3072 3256 4.06 
United States 5.55 3.86 3-70 3270 4.04 Seo? 4.32 


class is increasing, where decreasing, and where remaining practically 
stationary. Again I would remind the reader that our data for certain 
regions and occupations are insufficient to insure certainty regarding 
them. 

Only one occupational group was clearly increasing in the period 
when these college students were being born, viz., the farmers. One 
other group—the skilled workers—was just about holding its own. 
All other occupational groups (leaving out of account unskilled workers 
because of inadequate data) are below the reproduction line. Further- 
more only five of the nine groups of states, the West North Central, the 
South Atlantic, the East South Central, the West South Central, and 
the Mountain are above the reproduction level, and the margin is quite 
small in the West North Central and the Mountain states. 

It also seems likely that it would take a slightly smaller average 
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number of children per fertile family to reproduce the agricultural class 
than the professional, trade, and managerial groups, for a larger pro- 
portion of them marry and there are fewer childless couples among 
them. Thus we again find urbanization and industrialization most 
intense where there is a stationary or decreasing population in those 
groups which send children to college. Undoubtedly part of this 
difference between farmers and other groups studied here is due to the 
slower dissemination of information regarding birth control in the rural 
communities. But these and similar data clearly show that the effec- 
tive desire to have small families is greater where urbanized industry 
has attained its greatest development. 

We have no means of ascertaining with any degree of certainty the 
extent of the changes in the birth-rate in these different groups since 
the present generation of college students was born. It is reasonably 
certain, however, that the birth-rate has been declining everywhere. 
The result of this has quite likely been that some of the groups above 
the line of reproduction twenty to twenty-five years ago have fallen 
below that line today. Thus the professional, trade, and managerial 
groups in the South may now be failing to reproduce. It seems a safe 
assumption, however, that the farmers of the nation are reproducing 
themselves and adding a fair increment to the population year by year 
and that this increment is greatest in those parts of the country where 
urban life and industry are least developed. Such a study as this, 
based on data gathered from college students throws almost no light on 
the population situation among wage workers either native born or 
foreign born. 





COMPARISON OF CERTAIN TYPES OF COLLEGES 


In Table IV, data for certain types of colleges are compared. Both 
parents’ families and students’ families are given. It is interesting to 
note that a generation ago the families of those attending the two east- 
ern colleges were considerably smaller than of those attending the other 
institutions. This merely confirms other studies which have shown 
that the decline of the birth-rate began in the Northeast considerably 
earlier than in other parts of the nation. 

The rates of decline for these different types of institutions have not 
been uniform. The eastern colleges and Miami have gone down in 
almost exactly the same ratio (making the correction for the parents’ 
families of adding .5 of a child to the averages given here in all cases), 
the former 44.1 and the latter 44.7 per cent. In the Catholic institu- 
tions the decline has been 37 per cent, in Negro institutions 30.3 per 
cent, and in agricultural colleges 28 per cent. The fact that even a 
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generation ago the number of children in the families of those attending 
the two eastern colleges was considerably smaller than that of the other 
groups merely adds evidence to the already well established fact that 
the decline in the birth-rate began in the Northeast and among those 
economically better circumstanced. 

It is interesting to see, too, that the Catholic students’ families are 
on the average larger (by about 1 child) than those of people of similar 
occupational groups in the same cities. This also agrees with Holmes’ 
study in California, where he found that families in which both parents 
were Catholics averaged .96 of a child larger than those in which both 
parents were Protestants—4.44 and 3.48 respectively. Our study 
seems to indicate that the economic status of the Catholic families 
averaged a little lower than that of the Protestant families. This is 
shown in part by the rather large proportion of Catholic students com- 
ing from homes of skilled laborers. Furthermore, a careful perusal and 
comparison of the original schedules leads one to believe that Catholics 
engaged in trade and in managerial work are, on the whole, in positions 
where the income is less than the average among Protestants. Our 
data are, however, not sufficiently extensive or detailed to enable us to 
arrive at definite conclusions regarding the relative importance of reli- 
gion and class status in determining the size of Catholic families. All 
one can safely say is that not all the difference between Protestant and 
Catholic families is due to the religious beliefs of the latter, for in addi- 
tion there is the difference in economic status and the question of na- 
tivity to consider, as it seems likely that there are more students having 
foreign-born parents among Catholics than among Protestants. 

The data from colleges having only Negro students are rather 
meager but they indicate a rapid decline in the birth-rate in this group 
as elsewhere, and also that the birth-rate is little, if any, larger than 
among the whites who send children to agricultural colleges. If this is 
the case it undoubtedly means a slower rate of natural increase among 
the Negroes than in this other group, because the Negroes have a high 
death-rate, especially among young children. 

There is little need to summarize the findings of this investigation 
here as this has practically been done in Table III. Also the general 
conclusions have been stated in the discussion of it. However, it may 
not be out of place to make one general statement: The people who send 
children to college cannot be considered a homogeneous group. Clearly 
the farmers must be differentiated from the other occupational groups. 
Furthermore, different sections of the country show considerable 
differences in the size of the families from which college students come. 
Consequently sweeping generalizations regarding this class cannot be 
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made. It is not true that the native stock in the educated classes is are 
failing to reproduce, at least in the South and parts of the West, al- col 
though in the Northeast and other parts of the West such is clearly the ant 
case. More care, therefore, should be exercised in discussions regard- anc 
ing the dying out of the upper cultural classes of native stock than has the 
been and is the custom among popular writers on this subject. of 
foll 
FAMILY SURVIVAL on 
It became clear in the course of tabulating the data for this study me! 
that the similar environmental conditions under which relatively gol 
homogeneous economic and social groups lived did not produce pro 


uniformity in the number of children in these families. It seemed, on 
the face of things, that the customary explanation of small families in 
the upper economic and social classes, namely, the economic and social 
disabilities suffered by people with larger families, could not be a com- 
pletely satisfactory explanation. 

It is obvious, when the family rather than the class is regarded as the 
reproductive unit, that the family with one or two children is not 
reproducing itself. Families with three children are reproducing them- 
selves with a slight increase and those with four or more are increasing 
quite rapidly (especially in the classes where the death-rate is very 
low). Inthe students’ families we found 27.5 per cent below the repro- 
duction level, while in the parents’ families we found only 9.9 per cent, 
almost a three-fold increase of non-reproducing families in one genera- 
tion. Table V summarizes this information by occupations for the 
different sections of the country. 

In the different types of colleges for which we have data the per- 
centages of non-reproducing students’ families and parents’ families 


TABLE V = PERCENTAGE OF FAMILIES HAVING 1 OR 2 CHILDREN. 





Students' families Parents’ families. 
All Agricul- Managerial, Skilled All 
Occupations tural Trade, Occupations 
Professional 
New Engl and 35.9 . 20.7 37.3 21.4 18.9 
Middle Atlantic 36.2 25.2 38.3 34.9 14.1 
Bast North Central 33.6 19.5 38.6 30.3 9.9 
West North Central 25.2 9.6 32.3 33.9 7.8 
South Atlantic 16.5 6.8 22.9 22.0 8.5 
East South Central 1838 10.3 25.1 19,2 72 
West South Central 19,2 9-1 25.8 31.3 9.5 
Mountain 31.9 14.5 39.1 33.8 11.8 
Pacifie 33.3 22.7 3725 37.5 11.4 
United States 27.5 12.4 33.7 30.1 9.9 
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are respectively, two eastern colleges 46.3 and 18.1; four agricultural 
colleges (Middle West and South) 21.1 and 6.2; five Catholic colleges 
and universities 27.9 and 7.5; three Negro colleges 24.3 and 10.01; 
and Miami 39.1 and 11.5. There can be little doubt in the light of 
these facts that getting far away from Mother Earth lessens the chance 
of family survival. But, and this is a suggestion which cannot be 
followed up here, it also seems not improbable that there is a factor in 
survival of some families as compared with others which is not environ- 
mental. Does heredity play a larger réle in the selective processes 
going on in society today than it is generally credited with? These 
processes need more careful study than they have received hitherto 
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VARIATION AND CORRELATION WITHIN THE 
CREDIT STRUCTURE 


By Heten D. Fauxner, Wheaton College. 





I, OUTLINE OF THE PROBLEM 


Existing studies of the statistics of the banking system have been con- 
cerned almost exclusively with the analysis of aggregatesfor large groups 
of banks and in particular with the fluctuations from time to time of 
such aggregates.' A few special studies of individual banks have 
indeed been made, but in these also emphasis is upon the time varia- 
tions in the condition of the individual institution.2?. In the present 
inquiry, on the other hand, emphasis is upon the relative status of many 
individual institutions at a particular time. 

Data for such an investigation are available largely in a single source, 
the annual reports of the Comptroller of the Currency.’ The report 
gives an abstract, itemizing the principal constituents of the resources 
and liabilities, of the balance sheet of each member of the national 
banking system at the September date-of-call in the year covered by 
the report. Similar data might be compiled, from reports of state 
banking commissioners and from miscellaneous sources, for individual 
banks not members of the national banking system. The danger that 
such data would not be strictly comparable with the figures for the 
national banks, and the special statutory restrictions upon the na- 
tional banks constitute sufficient reasons for confining attention to the 
Comptroller’s figures. The material of this analysis is drawn almost 
entirely therefore from the official figures for national banks. 

At any given time, for example at a particular September date-of-call, 
the credit structure as a whole (as exemplified by the national banking 
system) can be described by exhibiting each principal item of the bal- 
ance sheet for each bank. The extent and complexity of the tabulated 
presentation of all these details would prevent any satisfactory or useful 
interpretation except by means of the technical summarizations 
afforded by modern statistical method. 

The item for loans and discounts, for example, differs for different 


1 Professor E. W. Kemmerer, Seasonal variations in the relative demand for money and capital in the 
United States, National Monetary Commission, 1909; Professor Warren M. Persons, Review of Economic 
Statistics, 1921 and 1924; Professor Allyn A. Young, Review of Economic Statistics, 1924 and 1925. 

2? Miss A. P. Youngman, Federal Reserve Bulletin, 1921; Professors William L. Crum and Homer B. 
Vanderblue, Harvard Business Review, 1925. 

3In a table generally entitled: ‘Condensed reports of the resources and liabilities of national banks 
at the close of business. . . .”. The exact form of the tabulation has undergone some change during the 
interval for which the reports are available, but the alterations have not been serious in the items used 
in this analysis for the period covered by the study. 
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banks. A graphic picture of this condition is furnished by the fre- 
quency diagram which shows the number of banks in the group for 
which the loans and discounts item falls within specified limits (See 
Chart 1). For the entire system of banks the item for loans and dis- 
counts can be summarized by stating the mean and the dispersion (or 
variability). The same process can be applied to each item in the uni- 
form bank statement. 

These steps yield, for each item of the balance sheet, a statistical 
description which is quite independent of the corresponding facts for 
the other items. A different aspect of the problem is implied in the 


Frequency 
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CHART 1 


Frequency curve of the loans item for Illinois (F. R. District 7) selected smaller 
banks at the September date-of-call in 1919. 


average relation of one item to another or to several others. The ex- 
istence of such an average relation between items of the bank state- 
ments is established statistically by measuring the correlation between 
the items, and the nature of that relation is indicated by the line (or 
curve) of regression. The measures of correlation and regression con- 
stitute additional summary facts, which supplement the knowledge 
furnished by the means and variabilities of the individual series for each 
item. 

The inquiry divides naturally into four sections. The first section is 
a study of selected smaller banks, banks which are not only “country 
banks”’ according to the accepted definition but are clearly concerned 
only in slight degree with large financial operations; and the second 
section covers selected larger institutions, whether in reserve cities or 
elsewhere, which are clearly accustomed to handle large institutional 
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accounts. The third section undertakes to trace cyclical and long-time 
changes in the summary measures—means, variabilities, correlations, 
regressions—over a long period of years; and the fourth section under- 
takes to apply these findings to the current situation by the use of 
figures originating in the Federal Reserve System. The present paper 
covers only the first section, on the selected smaller banks. 


Il. THE SMALLER BANKS COVERED BY THE STUDY 


In making a study of the statistics for the smaller banks, an objective 
definition of “smaller”? was needed. A detailed examination of the 
individual balance sheets, to ascertain for precisely which banks a 
considerable fraction of the aggregate liabilities was in the form of 
bankers’ deposits, might have been made; but such a survey would 
involve an undue expenditure of time and effort. The “country 
banks” might have been selected, on the ground that institutions not 
located in the reserve or central reserve cities would generally be free 
from those influences ordinarily affecting the operations of “ bankers’ 
banks.”’ Chart 2 shows the portion of total liabilitiesincluded under “ due 
to banks,” “demand deposits,” “time deposits,” and miscellaneous 
items, in the aggregates for the country banks and for the reserve city 
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CHART 2 


Percentage distribution of the liabilities, for country banks and for reserve city or 
central reserve city banks, in selected states at the September date-of-call in 1919. 
(Upper bar in each case is for the country banks. * Lower bar for central reserve 
city banks. f No reserve city or central reserve city banks.) 
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or central reserve city banks of selected states at the September date- 
of-call in 1919. An outstanding feature of the chart is that the item 
“due to banks” is invariably larger, and usually much larger, for the 
reserve city or central reserve city banks than for the country banks. 
Corresponding tabular data for other years and for other states serve 
to confirm this conclusion. Many country banks, however, carry large 
bankers’ accounts, particularly for state and private banks and bank- 
ers; and indeed mere financial strength, rather than location in a speci- 
fied city, appears to be the condition of the holding of large institutional 
accounts. 

It seemed appropriate, therefore, to select as smaller banks those 
having aggregate resources below a specified value. This value was 
taken uniformly at one million dollars, except for banks in the north- 
eastern region of the United States where it was set at two million 
dollars. The generally larger scale of financial operations in the north- 
eastern region seemed to justify this difference in the criterion: the 
institution with aggregate resources between one million and two 
million dollars might well be “large” in the west and south and yet be 
“small” in the northeast. The difference in criterion therefore tends 
to insure, very roughly, an equally inclusive selection of the smaller 
banks in all regions; but, as will be noted later, it is an obstacle to 
effective comparison of some of the results for the several regions. 

One furcher point should be remarked: a particular bank may be a 
smaller bank, in the sense of the definition, in one year and not in 
another year. Thus the banks included in the group are not identical 
from year to year. For the purpose of the study this practice is not 
objectionable: on the contrary, the continued inclusion of a bank which 
had grown beyond the limit—perhaps by the process of consolidation— 
would tend to break down the homogeneity of the selected group. 

As the compilation and analysis of complete data for all (approxi- 
mately 8,000, since the war) of the institutions which came under this 
definition of smaller banks at a particular date-of-call would require 
very long and laborious statistical operations, the study was limited toa 
sample, numbering about 2,500, chosen from among all the smaller 
banks. The discussion which follows (Section IV) indicates that in the 
present problem there is so great lack of homogeneity among the con- 
stituents of the entire group that any adequately representative sample 
for the entire United States can scarcely be chosen. Because of the 
great differences in the data for the various geographical sections of the 
country it appears fitting to give chief attention to conditions in specific 
regions rather than in the country asa whole. No thorough attempt is 
made, therefore, to secure a representative sample for the entire bank- 
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ing system. Although some study is made of each of the more impor- 
tant geographical divisions, and the survey is in that sense compre- 
hensive, it is not believed that the data analyzed can be combined to 
afford any satisfactory picture for the United States. The emphasis is 
upon the sections rather than upon the whole. 

The sample actually chosen included banks with aggregate resources 
not exceeding one million dollars in California, Colorado, Georgia, 
Illinois (that portion in Federal Reserve District 7), Kansas, Minnesota, 
Montana, Nebraska, South Dakota, Texas, and Virginia, and with 
aggregate resources not exceeding two million dollars in Massachusetts, 
New York, and Ohio. The data for loans, discounts and overdrafts— 
hereafter called loans—and for demand deposits (including United 
States deposits)—hereafter called deposits—were compiled for each 
bank in the sample at each September date-of-call in the years 1919-23. 
These data are the material of the main analysis. 


III, THE STATISTICAL ANALYSIS 


The steps in the statistical study were identical for all fourteen states 
and for all five dates-of-call, and will therefore be described only for a 
single case, that of Illinois (Federal Reserve District 7) in 1919. Fre- 
quency series were formed from the data for the individual banks, and 
appear in Table I. It should be remarked that the width of the class- 
interval is uniformly $50,000 throughout Column 1, except that the 
lowest interval ranges from 0 to 25,000. In the following calculations 
it will be assumed that every bank within a particular $50,000 interval 


Frequency 
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(Unir $100,000) 


CHART 3 


Frequency curve of the deposits item for Illinois (F. R. District 7) selected smaller 
banks at the September date-of-call in 1919. 
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falls exactly at the center of that interval—that all the banks having 
loans between $175,000 and $225,000 have loans of exactly $200,000— 
and that all the banks in the lowest interval fall exactly at zero. Both 
of these assumptions involve some error; but, for the purposes of the 
present analysis, these errors are not significant. With these assump- 
tions, Charts 1 and 3 exhibit graphically the series of Table I. 


TABLE I 


FREQUENCY DISTRIBUTIONS OF THE SELECTED SMALLER BANKS OF ILLINOIS 
(DISTRICT 7) IN 1919, CLASSIFIED ACCORDING TO THE SIZE OF THE LOANS 
ITEM AND OF THE DEPOSITS ITEM 


























Number of banks having 
Amount 
Amount in thousands of dollars unit of | 
$50,000 | Loans of Deposits of 
specified amount | specified amount 

ee ee ee 2 | 3 | 4 
> ee rere ee 0 0 2 
Sauda wee bop kh ieb eG ae eae eee ae 1 | 4 3 
(| abhi idpelene repeaters 2 | 20 | 19 
"EE AS eis BA a et ee 3 26 | 32 
oS Ses Weta dice sh’ Aide wth deck eet we area Shin ey ee eke 4 } 23 | 29 
Di i. tei ed Keb beeede ean aa eek deen eee as 5 28 | 25 
Ek oa a a A et ea | 6 23 | 20 
RH ae iste cheat anche aeaepa nampa ate 7 13 9 

Gs cin wah a oew beeer eee bee he ean b keel \ 17 | 2 
GG tb ee CLARE R ee RU Kee aE eRe ee TON awake 9 12 11 
IE hate ee neon toa ele ihe ea ie midi pcre 10 | 7 | 8 
ERED REY AE ATEN AM ig hehe EE ll | 4 | 1 
ss idnshunhscausdswkwhkenaaies cadens | iw | 3 1 
RN cee aie cee ae eee 13. | 2 3 
eae eae eR a tele aE } mw | 2 0 
nine dnngn ie Seance teat aes nominee ree: | 1 | 0 1 
I eRe ak ebm. 8 beh nl ln Wee 184 184 








One operation in the analysis of the two series of Table I consists in 
finding the arithmetic mean for each series, and these are found to be 
280 (thousand dollars) for loans and 266 for deposits. From the point 
of view of this paper an equally significant characteristic of the fre- 
quency series is the dispersion. Charts 1 and 3 indicate the strong tend- 
ency for the loans (or deposits) of individual banks to deviate from the 
mean, and a measure of such deviation is the standard deviation. 
Computation yields 142 (thousand dollars) as the standard deviation 
for loans and 138 as that for deposits. Although the standard devia- 
tion has much significance in the description of the frequency distribu- 
tion of a given series; comparison of the dispersion of such series with 
that for other series is facilitated by using the coefficient of variation, 
which is the per cent ratio of the standard deviation to the mean. 
Thus, for the series of Table I the coefficients of variation are 51 per cent 
(loans) and 52 per cent (deposits). 
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The mean and the coefficient of variation are the chief descriptive | 
numbers of the frequency series, for the present purpose. Although : 
many of the frequency series of this problem possess skewness, and : 
some of them in marked degree, the series are so irregular that a numeri- : 
cal measurement of skewness is scarcely valid. Moreover, asacareful [J 
comparative study of skewness is not essential to the discussion, the [7 
unavailability of an adequate numerical measure of skewness consti- ‘ 

TABLE II ‘ 
CORRELATION TABLE, BETWEEN LOANS AND DEPOSITS, FOR THE SELECTED 
SMALLER BANKS OF ILLINOIS (DISTRICT 7) IN 1919 
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tutes no obstacle to the analysis. A rough estimate of skewness of a 
series can be formed by inspection of the plotted frequency curve, 
and such estimate is quite sufficient in the present instance. The 
arithmetical calculations based upon Table I are therefore confined to 
the determination of the mean and the coefficient of variation. 

The 1919 data for Illinois were organized also in the double frequency 
series shown in Table II, which is the correlation table. If the means of 
| all the vertical arrays of Table II are found and plotted they tend to lie 
along a line—the line of regression (Chart 4).!_ The line of regression 
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Means of the deposit-arrays (vertical arrays), and line of regression of deposits 
on loans for the selected smaller banks in Illinois (F. R. District 7) at the September 
date-of-call in 1919. 


of deposits on loans affords a method of estimating the deposits item 
when the loans item is started. The regression coefficient of deposits 
on loans gives a number of units (of $50,000), which deposits tend to 
rise as loans rise by one unit. 

It should be noted that the line of regression must be replaced by a 
curve of regression in many problems; but, for the present study, errors 
involved in assuming that regression is in all cases linear are not serious. 
Moreover, the coefficient of correlation is most satisfactory as a measure 
of relationship only in cases where regression is linear; but, as linare 
regression holds approximately for all cases in this study, this point 
is of no moment in the present problem. For some correlation tables 
not only the means of the arrays but also the individual frequencies 
cluster more closely along the line of regression than for other correla- 

1 There is some evidence that the clustering is along a curve which is concave downward; but such 


evidence is not conclusive, particularly if each plotted point is weighted by the total frequency of the 
corresponding array. (Indicated on the chart by the numerals.) 
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tion tables. Where the clustering is close the regression line yields an 
estimate which is highly typical, and where clustering is less close the 
estimate is affected with a larger probable error. Ordinarily a good im- 
pression of the closeness of cluster can be obtained by examining the 
correlation table; and this is one of the chief reasons for constructing 
such a table, even in cases where the coefficients can conveniently be 
calculated without forming the table. 

The basic statistical results upon which the discussion will rest are 
those described above; the means and coefficients of variation for loans 
and for deposits, the coefficients of correlation between loans and de- 
posits, and the coefficients of regression of deposits on loans. These 
six numbers, for each of the fourteen states for each year, appear in 
Tables III-V. 


IV. INTERPRETATION OF THE RESULTS 


TABLE III 


MEAN VALUE OF LOANS AND DEPOSITS ITEM FOR SELECTED SMALLER BANKS AT 
SEPTEMBER DATE-OF-CALL 


(Unit: $1,000) 








(a) Loans (b) Deposits 





1919 | 1920 | 1921 | 1922 | 1923 1919 | 1920 | 1921 | 1922 | 1923 


























' 
Massachusetts.........| 473 528 534 539 523 522 521 496 520 | 536 
i ears 333 401 394 365 406 397 389 358 352 | 358 
Eee 387 438 417 4il 433 457 454 387 367 | 380 
sere 280 322 304 300 307 266 242 203 200 | 206 
0 326 320 326 350 356 235 223 182 168 | 158 
Ps icnntieaxewreawas 334 376 290 304 291 232 209 245 167 | 157 
De eee 274 316 256 265 261 246 261 206 232 | 251 
errr ee 293 302 258 270 272 269 260 221 219 | 220 
eer i. 406 322 347 349 265 235 184 203 | 191 
RS sen een semen 302 353 323 317 316 || 162 159 121 121 | 122 
South Dakota.........| 333 375 330 350 345 || 202 162 146 144 | 140 
EER ee 210 227 187 217 192 || 138 109 105 97 | 106 
I a a. arava ae 314 354 266 280 269 287 261 212 207 | 192 
eee 288 261 261 273 299 294 258 263 | 263 




















Examination of Table III reveals certain significant facts about the 
average size of the loans and deposits items. The movement 1919-20 
was especially marked in loans: the mean for every state except Virginia 
rose, and several of them rose very sharply. For deposits the move- 
ment was less wide and was generally in the opposite direction: for 
every state except Texas mean deposits declined 1919-20. During the 
next year 1920-21 the direction of movement for loans shifted: for all 
states except Massachusetts and Virginia mean loans declined. De- 
posits, on the other hand, continued the change begun in the previous 
year; mean deposits declined from 1920-21 for each of the fourteen 
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states. The changes for both mean loans and mean deposits were less 
uniform 1921-22, although there is a clear preponderance of increases in 
mean loans and decreases in mean deposits. For both series the year 
1922-23 was one of almost complete stagnation—such changes as oc- 
curred were too slight to be significant, particularly as there was no 
uniformity in direction of movement among the several states. 

Comparison of the conditions in the several states at any one date, for 
example, comparison of the items for 1919 in Table III shows very 
great lack of uniformity. It should be noted that such comparisons 
must be confined within two groups, the first three states (of the north- 
eastern region) and the eleven other states. Even with this reservation 
the conclusion stands: there is great diversity in the mean loans of the 
various states of each of the two groupsin 1919. The same observation 
holds for the other years, and likewise for mean deposits in each of the 
five years. 

Table III suggests therefore that the direction of movement within 
the several states is highly uniform in a period, such as 1919-21, of 
brisk cyclical changes in business and credit conditions, that at other 
times the direction of movement is likely to be quite uncertain, and 
that at any given time wide diversity exists among the means for the 
several states. 

The ratios of mean loans to mean deposits show the peculiarities 
found in Table III and indeed somewhat emphasize them. The 1920 
ratios in particular exhibit a uniformly upward movement from 1919, 
and at the same time bring into relief the extreme difference among the 
states. The range in 1920 from less than 100 per cent in Ohio and Cali- 
fornia to over 200 per cent in Minnesota, South Dakota and Montana 
suggests a vast difference between the states in respect of credit condi- 
tions and banking policy. That mean deposits should so nearly equal 
mean loans in some regions and yet fall far short of mean loans in other 
regions is one of the most striking facts revealed by the data. 

Further evidence of these conditions is found in the figures for coun- 
try banks, which are published in summary form in the Comptroller’s 
reports.' The country banks are more numerous than the selected 
smaller banks, and, as noted above, they form a less homogeneous group 
than the selected smaller banks.2. The observations made on the ratios 
for selected banks are confirmed by those for country banks. There 
are, however, many differences in the magnitude of change and even 

1In a table usually entitled: ‘‘ Abstract of the state and condition of the national banks at 
? The country banks for Illinois include the entire state, whereas the selected banks are for District 7 
only. This difference is not significant in the discussion. The item ‘“deposits”’ does not include U. 8 


deposits for the country banks, and direct comparisons between country banks and selected banks are 
not valid, 











506 American Statistical Association [40 


some differences in direction. On the whole, the country banks had 
the same experience as the selected smaller banks. 

Analysis of the country bank data from another point of view shows 
that time deposits constitute a much larger part of aggregate liabilities 
for such states as Minnesota and South Dakota than for Massachusetts 
and California.: This fact bears especially on the wide differences, 
remarked above, in the relation between mean loans and mean de- 
posits for the several regions. 


TABLE IV 


COEFFICIENT OF VARIATION, FOR LOANS AND DEPOSITS ITEM OF THE 
SELECTED SMALLER BANKS 


(Unit: 1 per cent) 















































(a) Loans | (b) Deposits 
1919 | 1920 | 1921 | 1922 | 1923 | 1919 | 1920 | 1921 | 1922 | 1923 
| | | 
Massachusetts.........] 60 60 58 58 54 54 59 55 62 57 
OS aaa 72 63 64 67 64 73 75 75 77 73 
ree 67 66 68 73 69 67 68 72 72 69 
OS: eee 51 49 54 53 53 52 51 62 65 60 
Pre 48 49 50 50 46 58 49 63 68 69 
eee 39 5 52 52 49 55 65 58 69 
er 54 52 6 58 57 58 59 63 58 57 
Sa 49 52 58 55 54 53 59 57 60 
OT SE 38 43 44 45 43 46 51 49 46 
NS io vice ees eee 46 46 47 51 50 54 46 56 58 56 
South Dakota......... 43 51 54 49 50 48 54 65 65 60 
I os inca elris Sree 79 68 78 68 76 89 83 2 68 72 
Re aa ig ig 68 Wearh 47 52 53 70 55 || 43 52 59 57 59 
ELS << 0.00-0e0dw an 61 61 59 57 60 53 55 65 60 60 
| 








Table IV, showing the coefficients of variation for loans and for de- 
posits, gives evidence of great lack of uniformity among the states at 
any one time, and considerable lack of constancy for any one state from 
time to time. The difference between these coefficients is especially 
wide for the different states at any one time. The fact that the coeffi- 
cient ranges from about 40 per cent to about 80 per cent for the fourteen 
states in a given year is evidence of important differences among the 
states in respect of the typicalness of the mean: the smaller the coefficient 
the more typical, in general, is the mean. 

Moreover, fluctuations in time are quite irregular, and in general 
without apparent rule. Exceptions to this lack of constancy in time 
appear, as for example, in California and Massachusetts. In most 
cases, however, instability is notable: the tendency of the coefficient to 
suffer sharp changes from year to year is an all but general feature of 


1 It would be possible to carry out a similar analysis for the selected banks, but the labor of compilation 
was deemed too large to justify making the test. It is doubtful if any different conclusion would be 
reached than was found for the country bank data. See also the presentation in Chart 2. 
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these tables. The concrete meaning of this is that, whatever be the 
movements in time of mean loans (or deposits) for the group of banks 
in a given state, the representativeness of that mean is likely to 
fluctuate quite extensively. 

These chronological changes in the coefficients are not haphazard 
throughout the record. Although there is no clear tendency of the 
coefficients to decline or rise, either for loans or for deposits, in the years 
1919-20, 1921-22, 1922-23, there is an unmistakable uniformity in 
the changes 1920-21. For both loans and deposits there is a heavy 
increase in the coefficient for nearly every state. This evidently indi- 
cates that, in the year of deep depression following the crisis of 1920, 
the conditions of the individual banks within each state became more 
widely dispersed from the mean condition for that state than had pre- 
viously been the case. Although some of this increase had disap- 
peared by 1923, the coefficients had not declined in most states even 
nearly to the 1919 values. Abnormal dispersion persisted into 1923. 


TABLE V 


COEFFICIENT OF CORRELATION AND COEFFICIENT OF REGRESSION BETWEEN 
DEPOSITS AND LOANS, FOR SELECTED SMALLER BANKS 


(Unit: 1 per cent) 












































(a) Coefficient of Correlation | (b) Coefficient of Regression 

| 1} 

| 

|} 1919 | 1920 |} 1921 | 1922 | 1923 || 1919 | 1920 | 1921 1922 | 1923 

| 
Massachusetts | 80 76 7: 78 72 || 80 74 65 80 77 
New York..... | 67 67 67 60 60 || 80 77 71 66 61 
Ohio... ae ee 65 72 78 76 89 70 71 69 67 
Illinois (7) ba | 57 66 71 70 65 | 56 52 54 58 50 
Virginia....... ; | 59 57 69 53 51 51 40 49 35 34 
Georgia ; 67 73 75 58 58 55 58 45 35 41 
Texus..... 74 71 83 70 71 72 66 71 63 68 
Kansas 70 83 84 83 76 6} 71 77 82 67 68 
Nebraska 68 64 67 72 67 53 45 44 46 38 
Minnesota... . . | 57 | 60 | 68 | 55 | 52 || 36 | 27 | 31 | 24 | 22 
South Dakota | 72 78 80 73 72 || #48 36 42 40 35 
MN. ose sceses | 89 84 88 84 81 | 65 49 52 37 43 
Colorado. . oe | 70 75 76 88 75 58 55 67 92 58 
California . ; .| 69 71 72 71 70 \} 68 66 78 75 67 

I | 





Table V shows the coefficients of correlation and regression; and, on 
the whole, there is greater constancy from year to year than was found 
for the other summary numbers. Ata given time, however, the differ- 
ences between states are quite marked; and this is especially true for the 
regression coefficients. 

The significance of this diversity can be appreciated only if the mean- 
ing of the line of regression is kept in mind. The inclination of the line 
of regression indicates the extent of the tendency for differences in de- 
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posits to equal differences in loans, as between different banks. The 
line of regression compares in effect the relation between loans and de- 
posits for larger banks with that relation for smaller banks. In other 
words, larger banks have on the average a higher loans item and a 
higher deposits item than smaller banks; and the inclination of the line 
depends upon the maintenance, between banks of all sizes, of a sub- 
stantially uniform relationship of loans to deposits. If the line is 
inclined at 45°, a difference of one unit in loans tends to be accompanied 
by a difference of one unit in deposits. If the inclination is less than 45°, 
the difference in deposits is less than the difference in loans. Thus, the 
less steep the inclination of the line of regression, the less fully does 
the difference between the deposits of two banks amount, on the aver- 
age, to the difference between the loans. 

Differences in the inclination of the line of regression for the several 
states at any one date-of-call signify, then, differences in the responsive- 
ness of the deposits item to changes in the loans item. That the co- 
efficient of regression can range from under 40 per cent to over 80 per 
cent is indicative of marked regional differences in banking policy. 
The extremely low coefficients for Minnesota are explicable in part 
by reference to the data on time loans; but, even when Minnesota is 
eliminated, there remains a wide range of variation ineach year. None 
of the summary numbers, even the mean deposits, shows greater re- 
gional non-uniformity than does the regression coefficient. This lack of 
uniformity suggests that there is no nation-wide banking policy in the 
matter of maintaining a relationship between loans and deposits or, if 
there is such a policy, regional differences in credit conditions are 
effective obstacles to its maintenance. 

Differences in the regression coefficient for one state at different 
dates-of-call carry a more subtle implication. Chart 5 shows the 
principal relative positions of the lines of regression for a state as they 
may appear in different years. Situations (a), (b) and (c) may arise 
if the mean-point (located by using the results for mean loans and 
mean deposits) for the later year clearly lies below the regression line 
for the earlier year; and situations (d), (e) and (f) may arise if the 
mean-point for the later year clearly lies above the regression line for 
the earlier year. At (a) and (f) the lines are divergent: the mean 
absolute (disregarding plus and minus signs) change in deposits de- 
pends upon the size of the loans item, and is larger according as the 
loans item is larger. At (b) and (e) the lines are parallel (strictly, a 
case is classified here if the two regression coefficients do not differ by 
more than 2 per cent): the mean absolute change in deposits is a flat 
change, it is approximately the same whatever the size of the loans 
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item. At (c) and (d) the lines are convergent: the mean absolute 
change in deposits depends upon the size of the loans item, and is 
smaller according as the loans item is larger. 

In a stable credit situation types (a) and (f)—if changes were only 
moderate—would be regarded as normal: a rise (or fall) in the mean 
deposits would be expected to be larger for banks having larger loans 
(and therefore also, on the average, larger deposits). Types (b) and 
(e) represent a development in which small and large banks tend to 
suffer equal changes in the deposits item, and types (c) and (d) repre- 
sent the still more abnormal development in which smaller banks 
actually suffer greater absolute change than larger banks. Under 


Earher year —— Later year ---<« 
CHART 5 


Type of shift of the regression line, from year to year. 


what circumstances might these abnormal movements (b), (c), (d) 
and (e) be expected to arise? Types (b) and (c) might conceivably 
appear during a period of prolonged liquidation, when there may 
exist, on the one hand a greater tendency for the commercial customers 
of the smaller banks to draw down deposits while clamoring for ex- 
tension of loans, and on the other hand a greater tendency for the 
deposits of the larger banks to accumulate in consequence of the stop- 
page of industrial activity. Types (d) and (e) might conceivably 
appear during a period of brisk business improvement, in which larger 
institutions may tend to expand their loans with somewhat greater 
liberality in respect to deposit requirements than smaller institutions. 
Finally, more than moderate movement of type (a) or (f) is the normal 
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accompaniment of credit strain and crisis: in a period of excessive 
business activity and credit expansion, the larger banks are likely to 
be the first to contract loans or to resist drawing down of deposits 
(type f), and in a period of acute financial crisis the larger and more 
powerful institutions may well be able to renew loans to meet distress 
needs with greater ease than the smaller institutions. 


TABLE VI 


TYPE OF SHIFT OF THE REGRESSION LINE, FROM YEAR TO YEAR; WITH NUMBER 
OF CASES OF EACH TYPE IN EACH YEAR 
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The evidence of Table VI, making liberal allowance for lack of pre- 
cision in the results and ignoring doubtful cases, tends to substantiate 
this view. The movement 1919-20 was a severe shift—by no 
means moderate—of type (a) in all states but three: it was a year in 
which credit crisis occurred. Eight states showed movements of 
types (c) and (b) in 1920-21, whereas none showed types (d) or (e). 
On the other hand, six states experienced movements of types (d) and 
(e) in 1921-22, whereas only three showed types (ce) and (b). The 
changes of 1922-23 were quite diverse and this is in keeping with the 
generally accepted fact that there were marked regional differences in 
business and credit conditions during that year. 

In general conclusion, the first summary statement must emphasize 
the marked, and in some respects scarcely explicable, regional and 
chronological differences in each of the six measures and coefficients 
studied. Variation—and wide variation—is the rule within the credit 
structure. In the second place it has been found that, despite these 
great diversities, there is an evident consistency in many of the in- 
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dividual variations, particularly those which take place in time. If the 
values of the means, variabilities and regression coefficients differ 
markedly from state to state, there is nevertheless some regularity as 
between states in the way these values change from year to year. 
Finally, the evidence of the regression coefficients is that, although 
there is some variation in time even for this summary number, there is a 
strong tendency toward internal stability within each state, and more- 
over such changes in regression as occur are of the sort which might 
fairly be expected to accompany the response of the banking system to 
fluctuations in industry and trade. 
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MEASURES OF CORRELATION DETERMINED FROM 
GROUPS OF VARYING HOMOGENEITY! 


By Truman L. Keuiey, Stanford University 


As is well known, a Pearson product-moment coefficient of correla- 
tion is affected by three factors: (a) the inherent connection between 
the true variables of which X,; and X2 are obtained measures, (b) the 
range of the variables, and (c) the precision of the instruments of 
measurement. In an objective sense we must of course state that the 
raw correlation, 712, between the actually available measures X; and 
X_ does measure the inherent connection between these variables, but 
if X; and X;2 are very fallible measures of underlying true measures X,, 
and X,, then commonly our theoretical interest is not in the corre- 
lation ry2, but in that between the true measures, row. Still further, 
if it is known that r,... itself is a variable quantity depending upon the 
variability of the populations sampled, our interest goes even beyond 
this measure to a measure independent of the particular sample drawn. 
Populations may be heterogeneous in a great many ways, but to make 
the argument specific it may be well for the reader to think of two 
populations of American white school children of the same race and 
general social level, but differing in chronological age. It might also 
be well to think of two measures of mental traits—these traits being 
chosen because all readers will agree that any obtained measures X; 
and X; are far from perfect measures of the underlying true traits, X., 
and X,. The problem in terms of these specific data is to consider 
the raw correlation 712, and any other promising measures of relation- 
ship, with a view to determining the influence upon them of (a) errors 
of measurement and (b) range of talent examined. 

The unit of measurement involved in the raw correlation coefficient 
is the standard deviation of the group, and a comparison of correlation 
coefficients derived from different populations carries with it the as- 
sumption that the standard deviations of the different populations in- 
volved are equivalent—not numerically equal, but equal in their real 
significance. With this in mind the purpose of this article may also 
be stated to derive a measure of relationship which does not make this 
assumption. The measure derived later does make another assump- 
tion—that the standard errors of measurement are the same for the 
same variable in two different populations. This kind of an error func- 
tion has had wide use in statistics as a unit of measurement. Thus, if 


1 The new measures here given were developed in connection with a study made possible by a grant 
from the Commonwealth Fund. 
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we have a group variance,’ o”;, made up of two factors o”,, and o*)..., the 
first the true variance not contaminated by errors in measurement, 
and the second the variance due solely to the errors in measurement, it 
should be clear that there are many situations when it would be 
reasonable to assume that o;.,.. is a constant from population to popu- 
lation, while o,, and accordingly o; vary greatly. 

In passing, it should be noted that if the error factor is small the use 
of it as a denominator or unit of measurement has the disadvantage 
inherent in a very small unit. If with the decrease in the size of this 
error variance, there is no proportionate decrease in its probable error, 
then this unit of measurement becomes unreliable. For these reasons 
the measure of relationship given later and based upon the constancy 
of this error function is not recommended if the errors of measurement 
are very small or if the ratio o¢1./¢1.«. is large. As to what constitutes 
being “large”? may be difficult to determine experimentally, but logi- 
cally we can say that, in those situations in which the proportionate 
error in the assumption that o;...=2).. (the lower-case symbol refers 
to a first population and the capital symbol to a second) is less than 
that in the assumption o;= i, it will be preferable to use a measure of 
relationship built upon o;.,.., rather than upon o; as is the Pearson 
product-moment correlation coefficient. The triple bar (=) has been 
used to mean ‘‘is equivalent to in terms of real significance,’’ for neither 
with respect to o; nor o;.. are we concerned with the actual numerical 
values but only with their fundamental equivalence in meaning. 

As to what constitutes “equivalence”? may be made clear by an 
illustration: If weights are estimated by two judges by the effort re- 
quired to lift them we will have two series of measures, let us say esti- 
mates in pounds by the first judge and estimates in kilograms by the 
second. If the actual series of weights lifted by the two judges have 
covered different intrinsic ranges, then o;#2;, nor will o,= 2), for re- 
ducing both series to the same units, let us say pounds, will not pro- 
duce equal standard deviations. In this situation it would be much 
safer to assume that our judges are equally excellent in estimating 
weights and that o;...=2;.., though numerically they would be wide 
apart, the first being the standard error of estimate in pounds and the 
second in kilograms. Now, finally, suppose judgments as mentioned 
have been made, but that we do not know that pounds and kilograms 
were the units employed in the minds of the judges. The lack of this 
information is the experimental fact that necessitates some assump- 


1 Following R. A. Fisher (‘The Correlation Between Relatives on the Supposition of Mew‘elian 
Inheritance,” Transactions of the Royal Society, Edinburgh, 1918), the term variance is used to sean 
o’, the standard deviation squared. 
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tion. The assumption universally made when product-moment cor- 
relation coefficients obtained from different groups have been com- 
pared is that o,= 2. Obviously in the situation as pictured this is 
not as reasonable an assumption as that o1..=21... 

If the product-moment correlation coefficient Le written in the form 


ra™ a2(2*) 
N \ajo2 
it is clear that the specific unit of measurement enters into the numer- 
ator and denominator to the same power and thus cancels out. We 
must not, however, conclude on this account that the measure is a 
pure abstract number and not one in terms of some sort of units. A 
pure abstract number does not exist in statistics. Without any procla- 
mation of the fact, the mere division of x; by o; has taken out the specific 
unit of measurement and put in the group variability as a unit and it 
stays in forever after unless some other process, perhaps an equaily 
unobtrusive division, takes it out and puts in a still different unit. 
This introduction has been for the purpose of showing the futility 
of attempting to obtain a measure of relationship not in terms of 
units of some sort, and to emphasize that every time one unit is taken 
out, some other one is put in. It is now pertinent to define a number 
of symbols and give a number of well-known equations which will be 
used later. 

X, is the individual score in terms of the original units in trait 1. 

2, is the individual score as a deviation from the group mean in 
terms of the original units. 
o, is the standard deviation of X,, or 2, scores. 

X, is a similar measure to X, that is, the same individual and the 
same trait is measured by an equally reliable instrument of 
measurement and the same units are employed. Thus in 
the long run X, differs from X, (and 2, from z;) only as the 
chance error in the measuring device causes them to be differ- 
ent. In the long run o; =o. 

ry is the correlation between these two similar measures X, and 
X;,, and is called a reliability coefficient. 

X.. is the individual’s true score in trait 1. It may also be defined 
as the average score of the individual if measured an infinite 
number of times under comparable conditions and with 
similar measuring devices. The standard deviation of such 
true scores is ox. 

Xj..< is that part of the X, score which is independent of X,. It is 
therefore the chance element entering into the X,; score and 
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thus it equals (X¥,—X..). Being the chance element, X;.. 
possesses the property of not correlating with any other 
measure, except X, of which it is a part. 

01.0 is the standard deviation of such chance factors, and being un- 
correlated with X,, we have 


ele he eee | | 
Corresponding to first variable measures given in the first line below 
are second variable measures as given in the second line: 
X1, X;, 01, F1, (a; =4¢;), >, Tx, Xte20 
Xo, Aum 02, Ou, (G2 =¢n), Xw, Tw, Xo. 
ry is the correlation between X, and Xe, and in the long run 
T12 =lTin =Te = lu. 


Tew is the correlation between X,, and X,q. 


r . ; 
Tow=————= . ........ . (Spearman Formula) [2] 


Vir: V ren 
(See Kelley, 1923, Statistical Method, Formula [155a]) 
~= av 





Pir 
(See Kelley, 1923, Statistical Method, Formula [166]) [3] 
O1.02=0,¥ lr 
(See Kelley, 1923, Statistical Method, page 222) [4] 


X}.20=that part of X, which is independent of X,, and X,. It is 
thus the chance factor in X, and accordingly X1..0=X1.. [5] 


Als0, 01-200 =O1-02= 01 1—ry (6] 
and for the second variable, 

X2-0w _= Xo. [5] 

2-00 = G20 =02Y 1—Tey [6] 


T12-0w being the correlation between Xi... and X2..w is the cor- 
relation between two chance factors and is therefore equal to 
zero. (7] 

T.~ being the correlation between X;.. and X2... is the correlation 
between one chance factor and another variable and is there- 
fore equal to zero. [8] 

Similarly, ri, the correlation between X;.. and X>2.4 equals 
zero. [8] 

If measures X; are equally reliable when used in each of two popu- 
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lations, we mean thereby that o;.,. for the first population is equal to 
the same standard error of estimate, which we will designate 2,..,, 
for the second population. If it is likewise true for the second set of 
measures used, then o2.~= Zs... The correlations and standard 
deviations corresponding to those for the first population given in the 
first line below are, for the second population, as given in the second 
line: 
Ti2 Tow G1 G2 Giew PF2w 


>> » >i > >i » » 
Ry» Rew “i ~2 “l-2o “2-w 


If the measures are equally reliable in the two populations these 
quantities are related to each other by the following equations: 





O1-00 = 21-00 
and | By definition of equal reliability [9] 
C2.w= 22.w J 
Vji— 2 
18 ee or Ry=1-—- 2 a—n) [10] 
21 1— Ti >; 


(See Kelley, Statistical Method, 1923, Formula [178]) 


Fo J Tir Ry (See Kelley, Statistical Method, 1923, 
>. Vl in, Formula [179]) [11] 


With this notation in mind we will proceed to derive a measure of 
relationship by first deriving one of disparity, or lack of relationship. 
The quantity (2;/01—22/o2)' measures the lack of mutuality of position 
in the first and second traits. It, however, involves a doubly variable 
unit in that o; and o2 change with the range of talent examined and, 
for a given constant population, change with a change in the reliability 
of the measure used. We will therefore take (2:/c0.—22/o) as our 
measure of disparity and call it Ay. The units here, o, and ow, 
change only with change in range of talent examined. The difference 
Ai: is independent of the specific units employed, that is, it does not 
require that the specific units (inches, centimeters, and the like) em- 
ployed be the same for each of two populations studied. It is depend- 
ent upon the true variabilities of the populations, that is, it assumes 
that o.= >. and o = Zw. Very commonly this true variability 
changes from population to population, but if the measures X; are 
equally reliable in the two populations and also the measures Xe, the 
standard deviation of the chance errors in Aj, will be equivalent for 








“Tn 


1 For a discussion of the properties of this difference see T. L. Kelley, ‘A New Method for Deter- 
mining the Significance of Differences in Intelligence and Achievement Scores,” Journal of Educational 
Psychology, Vol. XIV, No. 6, Sept. 1923; also K. J. Holzinger, ‘‘ Note on Professor Kelley’s Formula for 
Determining the Significance of Differences,” Journal of Educational Psychology, Vol. XVI, No. 1, Jan. 
1925; and also “ Reply”’ to this “ Note”’ in the same issue. 
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both populations. The error in Ay is Ajy..0 and equals (21..w/. 
—2X2.0w/Tw) ANd o7A12..0w is a unit which is constant for the two popula- 
tions, thus cAw..w=ZAw-0.w Therefore Ay/caw..w is a measure of 
; lack of relationship which is independent of the specific units em- 
| ployed, independent of the variability of the populations sampled, and 
F dependent upon the one constant feature which holds for the two 
populations—the constant error of estimate. 

This ratio is an individual measure and for the group entire we im- 
mediately have cAi/cAw..w aS the similar group measure of dispar- 
ity. Squaring we obtain o7412/¢7A12..@ Which, noting that 


A122 = OF Aww tO Al2-cw 
(this equation obtained as was [1]), may be written 
(07 Aw tO7A12-c0w) /O7A12-00 
This varies from 1 to infinity. Subtracting 1 from this ratio to obtain 


a measure whose lower limit is zero, we have another measure of lack of 
mutuality which will be designated 71.2. 
119 = O Awco/F7A12-00 [12] 
te= TAww/TA12-01 [13] 
The magnitude 72 is the standard deviation of the true differences 
divided by the standard deviation of the chance differences. The 
| direct comparison, by ratios or differences, of quantities 771. for different 
populations seems the obvious and most reliable way in which to make 
comparisons. Another measure varying from zero to one may be easily 
obtained which is merely a function of 72, as is jw of the following 
equation. All that is necessary is to take the sine of the angle whose 
tangent is 712. Designating this by j12 we have 
jw =sine (are tangent 712) 
ju= TAxw/TAr2 [14] 
This is a measure of disparity and has, as this form of writing it clearly 
shows, a very definite physical meaning; j is the ratio of the true 
standard deviation of differences to the obtained standard deviation of 
differences. The significance of this ratio is not as obvious as is that 
of 712, for here numerator and denominator are dependent, whereas in 
ty. they are entirely uncorrelated. 
The determination of 712 and j12 from a knowledge of ri, 711, Tex aNd Tow 
(given by equation [2]) is very simple. 


CAcw= 1 y(22—2) = 2—2row 
N 


Co Co 
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_—— i222 ~ ae) (lr) .e (ren) _ 1 ra a 
N ox Tw Ty Tou Ty, «Ton 
Accordingly, 
2(1— Tow) 
va a 
1 1 
—_+—-2 [15] 
MT Ty 
and 
2(1— Tow) 
J2= 
- +> Ire [16] 
Tr «Ton 


The formulas for 72 and jw give the ratios of certain standard devi- 
ations under the specific conditions as to reliability found to maintain, 
for r;; and rey, as well as riz, are experimentally determined correlation 
coefficients. If we desire an ij. or a jw value for other conditions of 
reliability, all that is necessary is to substitute the desired values for 
Tx, and fo, in equations [15] and [16], not, however, introducing these 
reliabilities into equation [2] which yields the value r.w. The range 
remaining constant, r.w will not vary (except for random chance 
fluctuations) with change in the reliability of the measures used. 

The method of derivation informs us that for two populations, 7. = 
I, providing the standard errors of measurement are essentially 
equivalent. If known not to be equivalent but to hold some ratio p to 
each other, we may use the equation 








Fl-x 

01 a v 1—ry, [17] 
“— = V1—Ry 

21 


from which, knowing p and r;,, we may determine R;,. This provides 
us with the value that should be substituted in the equation for J. or 
that for J:2 to yield for the second population a result comparable to 
that for the first. Again, we may know or have warrant for assuming 
the following ratio p’: 


p’= O1+~ Ox -_ yiats /15 Bs [18] 


> sy 
Za-a/ Ze Ti 











from which, knowing r;, we may determine Ry. There are thus sev- 
eral kinds of experimental situations which may be interpreted by the 
use Of 212 OF jis. 
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From disparity measures 7j. and ji: may be derived correlation 
measures gi2 and /y2 by taking the cotangent of an angle whose tangent 
is 732 and the cosine of an angle whose sine is ji. respectively, thus: 








kein 
9712 = 1 _ O*A12-20 _ Ti Tou [19] 
2 "Aww 2(1—row) 
—* 
‘ o °e Tr 
h?.= —jPr= =" eo — " i 7 [20] 
O° A12 = 4. a —2r ow 
Tyr) «=Ton 


The merit of independence of range, found for 712 and jis, is of course 
likewise a merit of g:2 and hy, but the writer finds it easier to think in 
terms of “differences” and “disparity” than in terms of ‘‘agreement”’ 
and “‘correlation,’’ and thus personally prefers 712 to giz and ji2 to Ais. 

A numerical illustration drawn from a much larger body of data to 
be published shortly will now be given: A certain group of typical 8- 
year-old school children were tested with an Arithmetic Reasoning 
test and with a Spelling test, with the following results: 


N =825 feuo= .60 
rig= .49 212 = 1.839 
lnu=.77 fu= . 648 
Ton =, 88 
Similar constants determined from an 11-year-old group are: 
N =887 Row= .50 
Ry= .43 I? 2 =2.250 
Ry= .75 J? = .692 
Roy =. 90 


The values 27). and J*. should be directly compared if it is known that 
the standard errors of estimate are equal; that is, if o:..=21.. and 
02. =e. In the situation in hand it seemed reasonable to deal with 
“sense’’ units,! that is, to consider for the 8-year-olds such a difference 
in ability as was sensed by a certain per cent of teachers to be equiva- 
lent to a difference in ability of 11-year-olds sensed by the same per 
cent of teachers. If units are so defined, then of course the standard 
errors of estimate are of necessity equal for the two populations. An 
experimental determination following this principle, not here further 
described, gave the ratio p=1.095. We may accordingly choose ry 


1 An argument in support of the “sensed distance”’ is given in T. L. Kelley, “Principles and Tech- 
nique of Mental Measurement,” American Journal of Psychology, Vol. 34, July 1923. 
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and ro, at will and determine by equation [17] Ru and Ren. Choosing 
Tu=Tfon=.85 we obtain Ry=Ra=.82. Redetermining ii, jw, I» 
and J,2 using these values in equations [15] and [16] for the reliability 
coefficients we obtain: 


09 = 2.267 
I*\y = 2.278 


We may accordingly conclude that intrinsically the disparity be- 
tween Arithmetic Reasoning ability (as measured by the test em- 
ployed) and Spelling ability (as measured) is substantially the same at 
ages 8 and 11. In terms of ji. we have: 


P22 = .694 

J712 = .695 

The same procedure was followed for a population of 982 14-year- 
olds. The results for the three groups are tabulated herewith. 


} Comparable measures of disparity 


\ Comparable measures of disparity 








| 

Usi d Using mit and rott as required to make the 
—s - _ standard errors of estimate, in terms of sensed 
rei as founc units, equal throughout the different ranges 
































Age | N m2 | Tir | Tar | Tow ti2 | ji2 liz jie 
| 
8...| 825 | .49 | .77 | .88 } -60 | 1 355 | .805 1.506 .833 
11...| 887 | .43 | .75 | .90 | .50 | 1.500 832 1.509 . 834 
14...| 982 | .33 76 | .81 | .42 | 1.452 | .824 1.523 .836 











Unfortunately the probable errors of 712 and j12 are not as yet known. 
They are, however, undoubtedly not less than those of the rz. values, 
so that it is probable that the ji. values given in the last column lie 
within about 1 probable error of each other. This is not true of either 
the ry» or the r.~ values. Accordingly different conclusions would be 
drawn depending upon the measure of relationship used. It is here 
claimed that for the understanding of the inherent connection the 7» 
or jie measure is the preferable device. The variability of our groups 
is not a factor in these measures, and the reliability of our measuring 
devices, though a factor, is not a variable factor, as equal errors of esti- 
mate (in terms of sensed units of ability) maintain throughout the 
three groups. In comparison with each other, therefore, these coef- 
ficients reveal the intrinsic intimacy of the two abilities in the sensed 
mental structures of the children examined at the three different ages. 

Summary: A comparison of two Pearson product-moment coef- 
ficients of correlation determined from different populations assumes 
an equivalence of standard deviations. Four coefficients, gi2, hie, ti 











bu 
th 
of 
tw 
of 
eli 


on. 




















55] Measures of Correlation Determined 521 





and ji, have been derived which do not make this assumption, but 
assume an equivalence between the standard deviations of the errors 
of measurement of the different groups. It is considered that this 
assumption is in many situations more valid than the former. 

In situations where the standard errors of estimate are not equal, 
but known to bear a certain ratio to each other, a simple adjustment of 
the 712 (or the giz, hi2, Or j12) values yields comparable results. Thus, 
of the three things affecting correlation, (a) inherent intimacy of the 
two functions, (b) range or variability of populations, and (c) reliability 
of instruments of measurement, a method is given for altogether 
eliminating (b) as a factor and making (c) a constant, thus yielding a 
measure of relationship which changes from population to population 
only as (a) changes. 
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AN AUTOMATIC CORRELATION CALCULATING 
MACHINE! 


By Cuark L. Hon, University of Wisconsin 


The machine described in the following pages was designed to elim- 
inate the drudgery and the persistent arithmetical errors from the 
calculation of the standard deviation and the Pearson product- 
moment coefficient of correlation. This has been accomplished by 
rendering the lengthy preliminary manipulation of the data quite auto- 
matic and mechanical, thus eliminating at once both the costliness and 
the fallibility of the human element. 

The machine also gives automatically direct and complete solutions 
of multiple regression equations and other prediction formulae involv- 
ing any number of variables. It thus makes aptitude and other esti- 
mates or forecasts quite mechanically and without the attention of an 
operator. Owing to the essential dissimilarity between the two func- 
tions of the machine, its use only in calculating the arithmetic mean, 
the standard deviation and the coefficient of correlation is discussed in 
the present article. The use of the machine for the automatic con- 
tinuous solution of multiple regression equations, especially for making 
wholesale multiple aptitude predictions for purposes of vocational 
guidance and selection is not described, though the machine is adapted 


to this further purpose.’ 

Since the machine is based on Thurstone’s formula for the Pearson 
coefficient of correlation and on Ruml’s related formula for the standard 
deviation’ the action of the machine will be best understood after a 


1 The writer is indebted to the University of Wisconsin for part of the fund with which the costly 
first model of the machine was built, and to the Committee on Scientific Problems of Human Migra- 
tion, National Research Council, for a considerably larger part. In addition the writer wishes to 
express his indebtedness to Mr. O. E. Romare, Chief Mechanician, University of Wisconsin, and to a 
number of skilful mechanics, notably Mr. Harold C. Kidder, for their splendid codperation and many 
valuable suggestions throughout the construction of the machine. 

The fundamentals of the design were first recorded in February, 1921. The working out of the 
complex problems of the design in detail required much thought and extended over a period of more 
than two years. In April of 1923 the construction of the preliminary model was begun. This model 
was exhibited at the Madison meeting of the American Psychological Association in December, 1923. 
Work was continued on the model during 1924. It was sufficiently perfected by the late summer of 
that year to do practical correlation work on a large scale, about 150 coefficients being computed by 
means of it. In December, 1924 the machine was demonstrated before the Washington meeting of 
the American Psychological Association, both in its capacity as an automatic correlation calculating 
machine and as an automatic solver of multiple regression equations. Since April of the present year 
over a thousand correlation coefficients and standard deviations have been computed with it. 

2See “An Automatic Machine for Making Multiple Aptitude Forecasts”, Journal of Educational 
Psychology, (about) December, 1925. 

3 Thurstone, L. L., Psychological Bulletin 1917, Vol. 14, p. 28. 

Ruml, Beardsley, Psychological Bulletin 1916, Vol. 13, p. 444. 

It is to be noted that the formulae of Thurstone and Rum! do not yield mere approximations to r and 
¢ respectively, but absolutely the same values as the older formulae. Though the arithmetic of the 
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brief illustration of the use of the respective formulae on a miniature 
set of data. These data are shown in columns A and B of Table I. 











TABLE I 
A B A? B AXB 
2 8 4 64 16 
4 5 16 25 20 
6 7 36 49 42 
13 9 169 81 117 
10 ll 100 121 110 
35 40 325 340 305 




















Thurstone’s formula is:! 
wie N=(AXB)—ZAX=ZB 
VNZA2—(ZA)? VN=B?—(SB)? 








Ruml’s related formula for the standard deviation in the same notation 
is: 





- VN>A*—(ZA)? 
N 


S.D. 





In the above formulae, A and B represent two parallel columns of 
data (see Table I), N is the number of items in a column, 2A and 2B 
are the sums of columns A and B, 2A? and 2B? are the sums of the 
squares of the separate items in the columns A and B, and =(A XB) 
is the sum of the products of the N items in column A, each multiplied 
by the corresponding item in column B. Substituting the values of 
Table I in the two equations: 


5X305—35 X40 
V5xX325—-352 V5x340—40? 
125 
~ 20X10 
= .625 














newer formulae is much simpler in most respects, the formulae themselves are exact mathematical 
equivalents of those in earlier use. 

1If both the means and the o’s are needed in addition to r, it is convenient to use the formula: 
= Maxp-MaXMp 

VM 4ya— (May VM pm— (Mp) 
This formula and the one given above are both derivations from Thurstone’s original formula. Neither 
is in exactly the form given by Thurstone. 





r 
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gp. = V5X325—35? 





=4 


It should be noted that once the radical in the denominator of the 
correlation formula has been worked out, the result needs only to be 
divided by N in order to yield the corresponding standard deviation. 

An examination of the above formulae shows that only three kinds 
of values need to be computed before the final formula is ready for 
solution: 


I. 5A, DB; 
II. SA?, >B?; 
III. S(AXB). 


It should be observed further that from a mechanical point of view all 
three types of values are practically identical since I and II may both 
be considered as product-sums as well as III. No. II is the sum of the 
products of the N items of column A multiplied by themselves, while 
I is the sum of the products of the items of column A each multiplied 
by 1. The machine, essentially an automatic products-sum calculator, 
is accordingly able to perform all three operations with practically the 
same mechanism. 

The solution of the final formulae for r and o with ordinary statis- 
tical aids, is a relatively simple matter. Where many items are in- 
volved, the time-consuming drudgery, as well as the arithmetical 
errors, lie in the lengthy preliminary manipulation of the data. By 
the present method no special correlation tables or preliminary hand 
computation of any kind are required. The columns of original data 
are simply transferred to perforated paper data strips by means of a 
special auxiliary machine. This operation may be compared roughly 
to running the data off on an ordinary listing adding machine. These 
data strips are then fed through the electrically driven calculating 
machine in different combinations and the machine gives automatically 
and directly, one after the other, the values required by the formulae. 
The formulae are then ready for the final solution. 

The data strips referred to above are made from tough kraft paper 
.007 inches thick and 4 inches wide. This paper is perforated on the 
edges somewhat like moving picture film. These edge perforations of 
the paper receive the teeth of feed wheels on both the recorder and the 
calculator in turn, insuring precise movement of the paper through 
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each. The operation of the recorder is somewhat like that of a type- 
writer except that instead of printing it makes very precisely located 
perforations. Numbers up to 999 may be recorded on the present 
model. Two strips of paper receive the record at the same time and by 
the same operation. These two records are exact duplicates. The use 
of the second or duplicate record will be explained later. The re- 
corder is provided with a Veeder counter which shows automatically 
the number of the item recorded at any given stage of the process, thus 
preventing the skipping or duplication of items in the transcription. 

A typical section of a record strip showing a variety of numbers is 
shown in Plate I. The upper part of this record has been marked off 
into rectangles and the digits represented by the various holes are in- 
dicated. Thus the first number is 9, the second is 24, the third 365, 
the fourth zero and soon. The hole marked ‘“‘ X” and found with the 
extreme left-hand digit of each number does not itself represent a 
number but controls the shifting of the machine from one number to 
the next. Twenty-four numbers occupy one foot of the tape. The 
successive columns of data to be correlated follow one after the other 
on the tape, a single blank space being interposed between each. 

Just as in operating a computing machine, there is on the recorder 
always the possibility of pressing a wrong key. On a listing adding 
machine like the Burroughs this may be discovered after the computa- 
tion has been performed. On non-listing machines, like the Monroe, 
such an error can only be discovered by repeating the operation. In 
the latter case, if this second performance shows a disagreement with 
the first result, the operation must be performed a third time to deter- 
mine which of the previous two discordant results is correct. In 
general, it is necessary to proof read the tapes on all listing machines, 
and to perform the computations at least twice on all non-listing ma- 
chines, to insure against errors from pressing wrong keys. In contrast 
to machines operated in either of these ways the present system proof 
reads the tape and makes any necessary corrections before the compu- 
tations begin, thus eliminating the necessity of repeating the compu- 
tation in question. Moreover, as will be shown presently, the proof 
reading of a single column of data may be all the proof reading neces- 
sary for from three or four up to sixty or seventy distinct arithmetical 
operations each involving that particular column. 

The proof reading and correcting are done very simply by the present 
system. On the recorder is a transparent number plate over which the 
paper record is drawn as it feeds through the apparatus. After a mass 
of data has been recorded the strip is merely run through the recorder 
a second time. As it passes over this plate the number corresponding 
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A sample strip of perforated tape (actual size). The first four numbers have been 
ruled off and the various digits indicated for purposes of explanation. The lower 
portion shows the normal appearance of the tape. 


to each particular perforation may be seen distinctly through the 
perforation itself. The record is thus easily read and compared with 
the original exactly as is done with an adding machine slip. In case an 
error is found a small square of gummed parcel-wrapping tape is placed 
over the wrong perforation and the correct hole is punched by the 
recorder as in ordinary recording. This is much simpler even than 
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correcting an error found on an adding machine tape which often in- 
volves considerable hand computation and usually a repetition of the 
entire operation. 

The computing machine itself is shown in Plate II. It is built into 
a steel table and occupies a space 26 inches by 32 inches square. It is 
driven by a } H. P. motor which receives current from the ordinary 
lighting circuit. The data strips are shown passing through the 
machine as in operation No. III, 7. e., cross-multiplication. The mul- 
tiplier strip is shown at (K), the multiplicand strip at (L). All that is 
necessary for the operator to do in this typical operation is to place the 
strip containing the numbers of column A in one position, the strip 
containing the numbers of column B in the other position and then 
press the starter (S). The machine then, without any further attention 
from the operator, automatically multiplies each pair of numbers (as in 
column V, Table I) one after the other continuously, adding up the 
products as it goes along. When the machine reaches the bottom of 
the columns, whatever their length, it stops automatically and the 
>(A XB) may be read from the dial (D) at the convenience of the 
operator. While the machine is doing the computation the operator 
is free to score test sheets, cut new records, solve formulae or whatever 
he may prefer. 

It is impossible in this place to give any very adequate notion of the 
detailed mechanism of the calculator, though a few fundamental 
principles may be mentioned. The machine, both at the multiplier 
(K) and multiplicand (L), is provided with steel fingers arranged in a 
pattern, one finger for each possible digit. These fingers periodically 
descend upon the perforated data strip described above. The particu- 
lar fingers, falling upon perforations, pass through and thus direct the 
action of the machine very much as the hands of an operator might 
do by operating keys. The remaining fingers ride up on the paper and 
remain inactive. Special automatic devices are provided for shifting 
the carriage during multiplication, for moving up the data strips to the 
next number when a multiplication has been completed, for handling 
zero numbers and for continuous multiplication by 1, by which is ac- 
complished simple addition. The squaring operation is performed 
simply by running the duplicate data strips through the machine in 
parallel, one duplicate through the multiplier and the other through 
the multiplicand. The action of the machine is purely mechanical, 
electricity being used only to drive the motor. 

The actual use of the machine may be described best in connection 
with a typical mental aptitude testing project. Suppose there are 96 
subjects, a complete set of criterion scores and 15 complete sets of 
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PLATE II 


rolls of perforated tape being fed into the multiplier 


answers are read off from the dial D. 


K and L are 


Upper view of correlation calculating machine. 


The 


and the multiplicand respectively. 
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test scores—16 columns of data with 96 items per column. The full 
computation of such a set of data requires: 
16 means, 
16 standard deviations, 
120 correlation coefficients. 

The 16 columns of data are first transcribed in duplicate on the data 
strips, one column following the other on the strip with one blank 
space between columns. The duplicate strips will each be about 65 
feet long and are handled separately, each in a convenient roll. The 
duplicates are next run through the recorder simultaneously a second 
time and proof read, any errors being corrected as described above. 
Absolute accuracy of the perforated record should result. 

One of the two duplicate strips is now placed at the point (L) of the 
machine which is set to add. The starter is pressed. The machine 
now proceeds automatically to add the various items of the first 
column as the strip feeds through. This is done without any attention 
whatever from the operator after the initial start. At the end of four 
minutes the 96th item has been added and the machine stops itself. 
The sum is copied off. The dial is cleared by the revolution of a small 
crank. The starter is pressed again and the same operation is repeated 
for the second and all the remaining columns. The results necessary 
for the computation of the 16 means are now available. They have been 
produced with mechanical accuracy from proof-read data. 

The strip just used is now replaced in the machine at (L) exactly as 
before. Its duplicate is placed at (K). Both strips are at the beginning 
of column 1. The starter is pressed. The machine now proceeds auto- 
matically to square each number of the column in turn, adding up the 
squares as it goes along and without any attention whatever from the 
operator. At the conclusion of the addition of the 96th square (about 
12 minutes) the machine stops itself. The sum of the squares is copied 
off and the dial cleared. The starter is pressed again and the machine 
proceeds at once to compute the sum of the squares of the second and 
all the remaining columns. The results necessary for the computation 
of the 16 standard deviations are now available. They have been de- 
rived by purely mechanical processes from strictly accurate data and 
are consequently free from human error. 

The first strip is replaced once more at (L) and the second strip again 
at (K). Before replacing the second strip, however, the first column of 
data is torn off so this (K) strip rests on the machine at the beginning 
of the second column. The starter is pressed. The machine now 
automatically cross-multiplies in succession each number in column 
one by the corresponding number in column two, adding up the 96 
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products as it goes along. As the last product is added the machine 
automatically stops. The operator copies off the sum of the products 
of column 1 by column 2, clears the dial and presses the starter. The 
machine proceeds at once to multiply column 2 by column 3, then 3 
by 4, then 4 by 5, and so on to column 15 by column 16, yielding fifteen 
cross-products in all. The first strip is now replaced at (L) exactly 
as before, at the beginning of column 1. The second strip is shorn of 
column two and replaced at (K) at the beginning of column three. 
The machine now automatically gives in succession at about 12-minute 
intervals the cross-products of 1 and 3, 2 and 4, 3 and 5, and so on to 
14and 16. After this, column three is torn off the second strip, which 
then yields still another series of cross-products. Then column four is 
torn off, then five, then six, and so on until all the 120 possible combina- 
tions of the 16 columns have been made. The results necessary for the 
computation of the 120 correlation coefficients by the formula are now 
available. Once more these results have been produced with mechan- 
ical accuracy from strictly accurate data. 

It will be observed that the formulae used by the present machine 
give the absolute values of means, standard deviations, and correlation 
coefficients. This is in contrast to the results obtained by the various 
short methods so widely used which, either through the grouping of 
data or the use of small numbers in place of large ones, never yield the 
correct values, except by chance, but only approximations. 

It must be emphasized also that while the machine is doing its 
various calculations, such as adding, squaring, and cross-multiplying, 
the attention of the operator is not required by the machine at all. 
He is quite free to do anything he may desire. For example, the 
writer on several occasions has started the machine on a long and diffi- 
cult column of computations, locked the laboratory and gone out to 
lunch. Upon his return the computation was found completed and 
the answer on the dial ready to be copied off, the machine having 
stopped itself at the conclusion of the operation. For this reason the 
amount of the operator’s time required in cutting the tapes, proof read- 
ing them and in actually serving the machine is surprisingly small. In 
a case like that cited above involving 16 variables, and counting the 
time required for adding and squaring as well as for the cross multi- 
plications, the operator would average only about five minutes of 
actual work in the preliminary preparation of the data for the final 
formulae, for each of the 120 correlation coefficients. This means that 
in large blocks of correlation coefficients the machine eliminates nearly 
all the labor except the solution of the final formula. 

Possibly of even greater importance, the machine eliminates almost 
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absolutely the element of human error which has so long vexed the com- 

puter of the product-moment coefficient of correlation. 
Two of the correlation machines have been built. The first one was 
for the Wisconsin psychological laboratory. The second one, a some- 
what improved model, was built for the National Research Council. 
Judging by the cost of constructing these two models, the computer, 
together with the two auxiliary machines, can be duplicated for about 
$1200.00. 

Owing to the fact that many institutions needing correlation work 
done would not have enough work to make the purchase of a machine 
desirable, it has been suggested that a central correlation bureau be 
established. It seems likely that such a bureau provided with one or 
twa, of the machines described above, together with other modern 
statistical aids and a specially trained personnel, could do a good share 
of the correlation work of the United States and with a promptitude, 
economy and accuracy previously unknown. Without any particular 
publicity a considerable amount of data has already been sent in, to be 
run on the first model built. If the demand for such a service as 
sketched above continues to increase, there will be established at Mad- 
ison in the near future a national correlation bureau operating on a 
regular commercial basis. 
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AN APPARATUS TO ASSIST IN THE CALCULATION OF 
THE STANDARD DEVIATION OF A GROUPED 
FREQUENCY DISTRIBUTION’ 


By Freperick E. Croxton, Columbia University 


The device herein described shortens the process of calculating the 
standard deviation of a grouped frequency distribution. It eliminates 
the necessity of calculating for each individual distribution, (1) the 
products of the frequencies of the groups and their deviations from an 
assumed mean and (2) the products of the frequencies and the squares 
of the deviations. These products are calculated once and for all 
around an assumed mean taken as the mid-point of the first group of 
the distribution. Due to this choice of the assumed mean all values 
handled are positive. Two forms of the apparatus will be described. 

First Form.—Each class of the frequency distribution is represented 
by a pile of slips of cardboard or light metal bearing the values of 
f, fd, and fd? for the various possible frequencies. The slips measure 
approximately 1% inch by 2 inches. Each pile of slips is fastened to 
a common baseboard by an inverted ‘‘U”’ shaped rod or staple passing 
through a hole in the left end of each slip, thus making it possible to 
expose the face of any slip. Figure I shows a side view of one of the 
piles. The piles of slips are arranged vertically. This may be seen 
in Figure II, which shows only five piles. For most work twenty to 
twenty-five piles will be necessary. 

Taking the assumed mean at the mid-point of the first class interval 
the entries on the slips for f, fd, and fd* appear: 


Piz I 
(First class interval—Deviation 0) 

f Sd id 
re 0 0 0 
 * ae 1 0 0 
2 0 0 
aS 3 0 0 

* * * * * 

Slip XCIX....... 98 0 0 
i cesadseken 99 0 0 


1 These designs represent improvements upon a device used by Prof. Herbert A. Toops of Ohio 
State University. 
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Pre II 
(Second class interval—Deviation 1) 
ft Sd ft 
Ee 0 0 0 
aid xend edi 1 1 1 
ee 2 2 2 
kp acincan 3 3 3 
” 7 * 7 . 

Slip XCIX....... 98 98 98 
RE 99 99 99 
Pre III 
(Third class interval—Deviation 2) 

f fd I 

Diteseseseuss 0 0 0 

ET 1 2 4 

ee 2 4 8 

DE Kéenwekede 3 6 12 
* > * 7 . 

Slip XCIX....... 98 196 392 
aE 99 198 396 
Pitre XX 
(Twentieth class interval—Deviation 19) 

tf fd ft 
RE ee 0 0 0 
_ "eee 1 19 361 
SS ieicelniie tian 2 38 722 
Ee 3 57 1083 

* * * * . 

Slip XCIX....... 98 1862 35378 
SIR ie xe aeanace Ae 99 1881 35739 


The vertical arrangement of the piles makes it possible to add quickly 
the f, fd, and fd? columns. These summations are then substituted 


in the formula le -(2AY. 
N N 


The general type of data handled will determine the number of 
slips to a pile. If the maximum frequency represented on the slips of 
each pile is 99, problems involving frequencies of any size may be 
handled by determining the sums of the columns first for the tens and 
units, then for the hundreds and thousands, and then if necessary for 
the tens of thousands and hundreds of thousands. 
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FIGURE II 
The complete apparatus contains twenty-five piles of slips. 


ILLUSTRATION OF CALCULATION 


Classes f 
I re bah el er aia ee tae 2 2 
SERS ers Se ee Ree ee ee 7 
i ee aes See a ate at) cae 13 
Ee een rarer nearer eee 9 
I ook ea ani ome cee tah rae hia 3 

ie ribs ae 34 


To determine the standard deviation of the above series only the 
first five of the piles on the apparatus would be used. On the first 
pile two slips would be turned back exposing the third slip marked 
2, 0, 0 (see Figure II); on the second pile seven slips would be turned 
back; thirteen on the third; nine on the fourth; and three on the fifth. 
Adding the exposed columns and substituting in the formula gives 


2 
is _ (2) with a resulting value of 1.02 for the standard deviation 
in terms of class intervals. 

Second Form.—This design differs from the first described in that 
the piles of slips are replaced by a series of discs, the data being placed 
radially on them so that any frequency for each group may be brought 
into line by rotating the discs. Figure III shows how the device would 
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look when set up for the illustrative problem used above. This device 
can be operated faster than the one first described but in order to place 
sufficient frequencies on each disc and to provide for twenty or twenty- 
five groups it may be necessary to make the vertical dimension so great 
as to result in an unwieldy tool. 
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AN INDEX NUMBER OF NET FOREIGN TRADE IN 
FOODSTUFFS 


By Greorce B. L. Arner, United States Department of Agriculture! 


For the past sixty years at least, the United States has been one of the 
world’s greatest areas of food surplus. Particularly in the last quarter 
of the nineteenth century, exports of food products originating on the 
new farm lands of the Mississippi Valley assumed enormous propor- 
tions and with cotton formed by far the largest “‘favorable’’ items in 
the balance of our international trade. 

But the surplus of foodstuffs cannot be measured by considering only 
gross exports, for there are imports as well as exports. While we export 
apples, we import bananas. We export wheat and import sugar. 
Furthermore, there are imports as well as exports of nearly every food 
product. While wheat and its products are usually the largest items 
in our exports of foodstuffs, millions of bushels of wheat are usually 
imported from Canada. Imports of dairy products are usually ap- 
proximately equal to exports. 

The index number of net foreign trade, here presented, is an attempt 
to measure the fluctuations from year to year in the effective surplus of 
foodstuffs. This index number is of the simple aggregate type using as 
a base period the five years ending June 30, 1914. In order to measure 
the changes in volume of trade, the net volume of exports or imports of 
each commodity in each year is multiplied by the average unit export 
or import price for the base period taken as a weight. The average 
export price was used as a weight throughout where exports generally 
predominated over imports and the import price where imports gener- 
ally predominated over exports. In computing the aggregates, net 
exports were taken as positive and net imports as negative, the aggre- 
gate being the algebraic sum of the products. The base used was the 
average of the aggregates for the five base years ending June 30, 1914. 
The formula for this index number may be expressed as follows: 


>=+QPo 

L+QoPo 
Here —Q, = net imports and +Q,; = net exports by volume in the 
given year; —Q = average net imports and +Q,) = net exports by 


1The computations and valuable suggestions in the preparation of this article were made by Mr. 
Luis J. Borja of the Department of Agriculture. 
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volume in the base years; and Py = the average unit export or import 
price in the base years. 

The number of commodities included in the computation was 57, in- 
cluding all the important grains and grain products, cattle, beef and 
pork products, dairy products, vegetable oils and oil materials, sugar, 
fruits and vegetables. On account of changes in classification of ex- 
ports and imports over a period of 45 years, some adjustments were 
necessary, particularly in the earlier years, to make the aggregates 
comparable. For example, certain pork products which were sepa- 
rately reported during the base years were combined in earlier years, 
making it necessary to weight the combined figures by an approximate 
export price determined by the relative importance of the commodities 
in the first year for which they were separately reported. Other minor 
commodities in earlier years were reported in value only. For these 
products it was necessary to estimate the quantity on the basis of the 
unit price prevailing in the first years for which both quantity and 
value were reported, and multiply this estimated quantity by the unit 
price in the base period. It was not considered advisable to carry the 
index number back of 1880 because the increasing number of adjust- 
ments would make the resulting index number unreliable. Since 1880, 
however, all of the more important commodities have been reported 
both in quantity and value, and reference to Table I will show how 
heavily the index number is necessarily weighted by such major export 
commodities as wheat, pork products and cattle, and on the import side 
by sugar. 

Another type of adjustment was necessary under the item of cattle, 
which in the first half of the period covered constituted one of the most 
important export items. Cattle exports have usually been either of 
heavy cattle for slaughter or breeding stock, while cattle imports have 
usually been of young feeder stock from Canada and Mexico. It 
would obviously be inaccurate to treat imported cattle, head for head, 
as equivalent to exported cattle. Adjustment was therefore made on 
the basis of relative unit prices of exported and imported cattle each 
year. 

The index number as shown in Table II was computed not only for 
all foodstuffs but also for net exports of animal products and for grains 
and grain products, and for net imports of sugar and for the miscellane- 
ous group composed chiefly of fruits, nuts, vegetables, vegetable oils 
and oil material. For sugar and the miscellaneous group, the aggre- 
gates have been negative, or on the import side, during the whole period 
of computation, giving an index number of net imports, which, while 
mathematically positive, is negative in its effect on the resultant index 
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BASIC DATA FOR INDEX NUMBER OF NET FOREIGN TRADE IN FOODSTUFFS 


(Net Imports —) 


































s= 115, 754,964 
’ 


: t Ro t Po t+ x Po : Q@1 + Q@x Po 
Group commodity $ Average net exe Unit ' Bet exports om 
and > Unit ;Pporte or net ; price ;: Base snet imports +: Aggregate 
commodity : imports in the; base s aggregate : 1925-2 ’ 1923-24 
: sbase period _; period ;: : : 
: ’ * Dollara: ’ ’ 
CN re : ’ * 112,156,552 * ’ 148,967,660 
Animal produscts,.......ccecses! ‘ : #3 130,109,674 * 8 229,655,648 
Grain and grain products,.....' ‘ ’ * 129,106,552 * ‘ 174,806,208 
ecccccecececcecccccccccce! ; ’ ’ 105,289,606 ’ $ 177,626,998 
Miscellaneous(fraits, nuts and! ‘ s s 8 8 
vegetables).....! 8 J § = 41,7 s _ 
th. secuimesisueen || 1,621,521 * 257 * 431,653 * = 25,566,720 ' = 5,585,515 
ai i iaditteeiialeaiaintiaen " t= 44,185,601 * 2179 * = 7,908,900 * = 62,496,028 * ~ = 11,167,147 
Ltilk(condgnsed, evaporated and! 8 id ad ® : 

POWHOFOd) .,....cecceceses? ® ’ 3,754,481 3 081 304,115 : 99,148,516 * 6,031,030 
Eggs in shells,...........0.e0*DOSEN 4 12,020,720 + 2225 8 2,479,662 * $2,405,621 : 7,291,265 
Yolks of and frozen eggs......:Pounds = 756,364 + 21273 = 96,058 : = 24,220,808 5 = 5,076,045 
BOOT, CONNCA....eccecesesscreed ® ’ 9,192,122 1 2118 : 1,064,670 + 1,591,404 + 167,766 
BOO, PLOKLOA,..ccceeecccesseet ™ ts $2,609,765 084 + 2,756,020 s 21,650,961 + 1,635,462 
itty dl cuscsieswanaccoedt  o t= 7,616,717 3 2105 1 = 820,965 1 = 22,162,620 = 2,329,175 
GOED Panic ncscceecseccace eestNumber 3 20,150 + 62,302 + 1,656,759 + = 96,696 t = 7,974,755 
POFK, CANDO... ccccccccccesestPOUMg 8 4,227,086 1 91273 536,840 + 2,724,916 3 346,065 
SM icicdsmnmmnecs! © 1 48,274,929 * 2105 * 5,068,866 | 37,469,599 * 3,954,287 
FO, WEMcccccedoccecscccece® ® s 925,335 * 1m 114,4% * 75,400,700 * 9,349,687 
Pe atateemrersnotonerveed e 8 5,642,411 * 2136 ! 767,368 * 11,920, 1,621,163 
Smeage casings......... ssocas® * 25,802,717 * 2187 ! 5,534,972 * 7,041, 964,630 
a re dtm[~~/:* © * 161,593,514 * 2127 23,062,376 * 408,099,391 * 51,628,623 
Haus and choulders............! « * 166,613,1% * 2130 * 21,685,707 * $69,456,550 * 48,029,612 
EE TE 2 474,351,2% 0110 * 52,178,636 *1,014,698,368 * 111,658,623 
Animal fate md Geiss” 5 249,431,868 * 2093 * 25,197,164 * 162,607,464 * 15,122,4% 
yutton and lamb,....... eS 8 713,225 * 2107 * 76,515 * = 1,619,491 -« 194,686 
a Seen Fe # 35,120,794 * 080 * 2,809,664 * 30,410,539 * 2,432,827 
Apples, fresh,......ceccscscee Barrels * 1,538,972 * 3,655 * 5,624,943 * 4,054,640 * 14,619,709 
Apricots, Gried.........00- «es Pounds * 19,438,006 * 109 * 2,118,745 * 36,776,678 ° 4,226,656 
Feasins, GFA. .cccccccccccs oot On s 6,481,963 3 .072 8 394,701 4 12,974,647 * 934,175 
PIMA ccccccccccceccescocceses? ® s 60,427,650 3 2059 3 4,745,251 * 136,448,485 * 6,050,461 
GREED ons ccceccesecesseee soos BINS s 1,114,378 * 2,533 % 2,822,719 * 2,588,668 * 6,557,083 

LOMODS , eeccccssccecsseesceceestPOUDGS  %=164,162,121 * 2025 ' = 5,775,729 * = 55,924,856 * - 1,266,271 
Bananas,..... eteonees eoeceess*Bunches %= 41,915,118 * 2326 4 = 13,664,527 * = 42,597,148 * = 13,886,670 
COPTBcccccccccccccccccccccccce* FOS Ye 45,061,788 * 2048 * = 2,162,965 * —341,034,514 * = 16,369,657 
SES Bice csnesicees eS Fy t= 655,775,861 § 2062 * = 4,409,621 * -159,967,534 * =15, 117,338 
Olive oil, edidle,.......se.e. ‘Gallons *= 4,671,659 * 1,300 * = 6,535,131 * = 60,794,224 * «= 135,977,401 
ncasscccesccescevescces™ © te = 339,669 * 04435 ' = 1,922,473 ' = 6,600,980 * = 5,012,8% 
a, ne ..-'Pounds ‘4= 17,614,962 * 2056 *§ = 986,458 * =~ 44,555,092 * = 2,494,975 
AIMONAB .gcccccccccccccccccccce, ® S— 16,925,352 * 2201 ' = $,401,996 * = 25,672,870 * = 6,200,447 
POMS. ccccccesecescccecsecce’ @ t= 16,442,735 * 041 ' - 756,152 * = 46,075,430 = 1,689,012 
Pe Mvtmsscmensanencn © *- 11,615,385 * 2078 * = 905,644 + = 21,089,505 * = 1,644,966 
SE iiticiatcnsssnenses “a @ *— 31,916,358 * 2042 ' = 1,360,467 ' = 16,746,561 * = 705,348 
indcvcsscescasevesesescses? © t= 25,462,620 3 2020 t = 509,256 t = 39,069,022 = 761,380 

' © t= 55,066,746 + e118 § = 35,904,256 * = 56,224,705 ' = 4,274,515 

” t= 18,703,745 3 .049 t= 916,465 * = 30,980,922 = 1,516,065 

" # 14,705,871 # .059 3 867,528 62,605,895 * 4,873,748 

FOCAIEB . coccccccccece CCITT ipuaneae t- 1,917,661 * 608 * = 1,549,648 * 2,512,262 § 2,029,908 
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BOTLEY s+ .ereeerereevereeeeeere BUSMDLS 8 7,895,520 * 645 * 5,076,619 * 11,206,733 * 7,207,215 
Ba ciancnccececveccesoesneses” * 37,145,736 * 63 * 23,550,397 ' 20,959,294 * 15,268,192 
oo agmnnnonnnannniens : 399,675 ; 8,878 + 1,350,776 3 487,214 1,645,809 
cccccescoscccccctee 8 5,144,499 + 2403 1,267,253 1 = $,069,911s = 1,257,17% 
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TABLE II 


INDEX NUMBERS OF NET FOREIGN TRADE IN FOODSTUFFS, 1880-1924 
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number of exports of all foodstuffs, and is therefore plotted on an in- 
verted scale. It will be noted that for all foodstuffs the trend is gen- 
erally upward to a peak in 1898 with a rapid decline from 1906 to the 
outbreak of the world war. Following the war the peak of 1919, 
practically equal to that of 1898, has been followed by a decline to a 
comparatively low point in 1924. 

The animal products group shows pre-war peaks in 1899 and 1906 
with a post-war peak in 1919. The net exports of animal products 
have shown less fluctuation and have been better maintained in the 
periods of declining exports than have the exports of the grain group, 
which had its pre-war peak in 1898 and its post-war peak in 1922. 
Sugar and the miscellaneous group both show very definite upward 
trends in net imports during the whole period covered, and this increase 
in imports has, of course, the effect of lowering the index number of 
surplus of all foodstuffs. 

The construction of other group index numbers was considered, but 
it was decided that index numbers of this type with a low base figure 
would fluctuate too widely to be of value. An index number of dairy 
products, for example, would have a base aggregate of about — 40,000,- 
000, while for 1880 the aggregate would be nearly +40,000,000 and for 
1919 about +67,000,000, falling again to —8,000,000 in 1924. It is 
obvious that an index number based on these aggregates would be con- 
fusing and valueless. 

The index number of net foreign trade and the several group index 
numbers are presented graphically in the attached chart. This graph 
shows how definitely the end of the rapid agricultural expansion of the 
last quarter of the nineteenth century can be dated, and how rapid was 
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the decline in our net exports with the industrial expansion of the ten 
years before the world war. Following the war-time boom in export 
trade, a distinct decline is again evident. The 1924-25 index number 
will, of course, be much higher than that of 1923-24 on account of the 
heavy wheat exports in the year just closed, but from the present out- 
look for grain and pork products, the index number for 1925-26 may 
fall below the base figure and approach the low mark of 1913-14. 
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THE ERROR IN ELIMINATING SECULAR TREND AND 
SEASONAL VARIATION BEFORE CORRELATING 
TIME SERIES 


By Braprorp B. Situ, United States Bureau of Agricultural Economics 


The practice of eliminating secular trend and seasonal variation can 
be justified only upon the assumption that the influence of numerous 
factors which determine these movements in a time series may be ex- 
pressed as functions of some numerical description of the passage of 
time. The actual passage of the years and the recurrence of the sea- 
sons, in themselves, can have no influence upon building contracts, for 
example; rather it is the growth of population, the rainfall and sun- 
shine brought by the seasons and countless other factors which origi- 
nate the trend and seasonal movements. It is merely a statistical con- 
venience to group all these factors together and predicate that their 
combined effect may be defined as a mathematical function of time 
when time is numerically described. It is infinitely easier to give the 
passage of time a quantitative measurement than it is to measure all the 
phases of the weather, the population, etc., which contribute to those 
variations in series customarily called secular trend and seasonal varia- 
tion. Secular trend is usually taken as a function of a series which in- 
creases one unit increment in magnitude with the passage of each year. 
Seasonal variation is a function of an annually recurring series of values 
increasing during the year from one to twelve. 

The assumption that the effect of factors which vary in time may be 
expressed as functions of numerically designated time is true only to the 
extent that such factors actually do vary in magnitude proportional to 
the associated numerical descriptions assigned to time. This is seldom 
the case. It is therefore advisable to use those series which actually 
measure the influencing factors themselves, whenever they are at 
hand, rather than to represent them as functions of time. That at 
least one such series is at hand—and frequently an important one—is 
implied when one time series is correlated with another. Should these 
two, dependent and independent, series chance to have approximately 
similar trend or seasonal movements, and should these latter be ex- 
tracted from the two series prior to their correlation, one might under 
the name of seasonal and trend extract much of the variation by which 
these two were related, and thus obscure their true relationship. The 
unconsidered practice of eliminating trend and seasonal from series 
prior to their correlation is to be looked upon askance, therefore. It is 

often a serious error. ‘Two examples serve to illustrate. 
Suppose that the height and weight of a single individual were meas- 
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ured annually between the ages of one and sixteen, and that the result- 
ing series of weights and heights be correlated. A high degree of cor- 
relation would be shown from which by thee ustomary process of 
deduction, of which correlation is but the tool, it would truthfully be 
maintained that height and weight had something to do with each 
other. But if the secular trends of the two series were eliminated and 
residual variations correlated, the correlation coefficient would be 
nearly zero, from which the absurd conclusion that weight and height 
had nothing to do with each other might with apparently equal logic be 
drawn. Such faulty methods tend to discredit the use of correlation 
principles in analysis of time series. 

Consider a second example: Suppose the relation of supply of eggs 
to the price of eggs, prior to the development of cold storage facilitics, 
were being studied. The pronounced seasonal production would in- 
duce a corresponding seasonal fluctuation in price. This seasonal 
fluctuation in price is very obviously not attributable to a number of 
vague, “otherwise unmeasured” factors, outside of the system we are 
analyzing, as the elimination of seasonal variation would imply; but 
rather it is directly attributable to the fluctuation in the production. 
To extract the seasonal variation from these series before correlating 
them is to obscure greatly the true causal relation subsisting between 
them. Only that portion of the price movement inexplicable in terms 
of the relation to the known factor—production—is properly attrib- 
utable to trend, seasonal and other factors. The error of extracting 
seasonal variation in this case is so obvious that probably but few in- 
vestigators would fall into it. Nevertheless it illustrates many cases of 
identical, though subtler, nature wherein the error has been made. A 
somewhat ironical aspect of extracting trend and seasonal in the search 
for relationships is that those phenomena which are fundamentally re- 
lated are apt to have either similar or inverse trend and seasonal fluc- 
tuations; the elimination of these, prior to their correlation, therefore, 
tends to obscure the very relationship which is being sought. 

The escape from this predicament involves no new theoretical con- 
siderations but merely the application to time series of recognized cor- 
relation theory with strict reference to the implicit assumptions cited. 
All that is necessary is to remember that fundamentally a numerical 
description of passage of time is merely taken to represent the magni- 
tude of the combined effect of otherwise unmeasured factors and then 
this series of ‘‘magnitudes”’ is treated precisely as any other independ- 
ent factor. If we are concerned solely with the effect of these factors 
as so measured on the dependent series, then correlate the two time 
measurements (secular and seasonal) with the dependent, and the re- 
gression curves of the dependent series on the time measurements will 
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yield the net regression curves, or curves showing the net effect of trend 
and seasonal. If we are comparing the dependent with several inde- 
pendents, however, the domain of the “otherwise unmeasured ” factors 
is diminished by the removal of these independents therefrom, and the 
effect, therefore, of the remaining “otherwise unmeasured”’ factors is 
not necessarily the same. The proper modification will be secured if 
we proceed in accordance with usual multiple correlation method which 
apportions to each independent variable, including the ‘otherwise un- 
measured,” its net importance in determining the variability in the 
dependent. Stated as a rule the process is: Consider the trend and 
seasonal designations as two, causal, independent factors in accordance 
with the basic assumption implicit in any use of them and solve for 
their net effect on the dependent simultaneously with any other inde- 
pendent factors involved by the methods of multiple correlation. 
Since the curve representing seasonal variation must necessarily return 
to its starting level, ordinary linear multiple correlation is inadequate. 
Curvilinear correlation methods as recently developed, however,' give 
satisfactory results. 

In following this practice, as often occurs, proper methods go 
hand in hand with better results. On theoretical considerations, corre- 
lation coefficients secured by simultaneous, or multiple, correlation 
methods will be as high or higher, and never less, than those resulting 
from any possible sequence of consecutive elimination of the influence of 
independent factors from the dependent, of which current methods of 
eliminating seasonal variations before correlating are an example. In 
actual trials of the two methods the writer has found that the simul- 
taneous solution for trend and seasonal regression curves and curves 
for other factors always gave markedly higher correlations. Further- 
more, there is considerably less labor involved in the simultaneous 
method, for the abstraction of trend and seasonal from each series be- 
fore they are correlated is eliminated. 

It might be noted in closing that the correlation of two series, x and 
y, from which secular trend and seasonal variation have been removed, 
results not in a coefficient of gross correlation, as often is conceived, 
but in something closely akin to the coefficient of partial or net corre- 
lation, rzy.4;, Wherein the subscripts ¢ and s designate trend and seasonal 
respectively. This coefficient, if desired, may be easily secured from 
the recommended coefficient of multiple correlation, Rz.y,., by a simple 
computation. The net coefficient mistaken often for a measure of the 
gross relationship subsisting between the two series should be inter- 
preted with a full realization of the usual implications of net coefficients. 


1“A Method of Handling Curvilinear Correlation for Any Number of Variables,” by Mordecai 
Ezekiel, this JourNaL, December, 1924, 
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THE APPLICATION OF STATISTICS AS AN AID IN 
MAINTAINING QUALITY OF A MANUFACTURED 
PRODUCT 


By W. A. SHewuart, Bell Telephone Laboratories, Inc. 


The object of this note is to emphasize what appears to be a com- 
paratively new field of application of statistical methods. 

As is evident, the quality of manufactured product is determined by 
those characteristics, either qualitatively or quantitatively measurable, 
which serve to distinguish it from all others. A uniform product is, 
of course, desirable. For the purpose of insuring the desired uni- 
formity of product, the manufacturer customarily defines what he 
considers to be the necessary steps in the manufacturing process 
including the selection of raw material and the inspection of the 
final product. However, he realizes that, irrespective of the care taken 
in defining and controlling the processes, the units of the product will 
differ among themselves in respect to any one given characteristic. 
He realizes that a product in which all of the units are identical cannot 
be obtained, and, in most instances, can cite possible typical causes for 
the existence of differences between units. 

For example, random fluctuations in such factors as humidity, 
temperature, grade of raw material, and wear and tear on machines 
may introduce resultant differences between units in respect to any one 
characteristic. Similarly, physiological and psychological conditions 
of the personnel involved in the manufacturing, assembling, testing, 
and inspecting operations may furnish other causes which produce 
differences between the units. 

Thus, granting that it is impossible to eliminate all such causes, just 
as it is impossible to eliminate all causes of error in making a physical 
measurement, the manufacturer tries to eliminate those causes which 
produce irregular, cyclic, or secular trends in any one characteristic 
of the product just as in the physical laboratory the experimenter 
tries to eliminate such causes when making physical measurements. 
The resultant system of causes, after the elimination of those which 
vary with time, can be thought of as constant in the sense that its 
effects are distributed about some average independent of time. 
Under such a system, one month’s product will not be identical with 
another, but the differences from month to month will fluctuate at 
random about some average. 

Even though the manufacturer is successful in establishing processes 
which will give him initially a uniform product, nevertheless he is 
seldom successful in setting up a manufacturing procedure absolutely 
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fool-proof in the sense that it precludes all possibilities of causes enter- 
ing the manufacturing process which produce trends in the product. 
Hence he is always on the alert for detecting the existence of the effects 
of such causes when studying the results of inspection. 

Ways in which cyclic, erratic, and secular trends may be superposed 
upon the effects of the constant system of causes scarcely need be 
mentioned. For example, seasonal changes in temperature and 
humidity may produce cyclic fluctuations in the quality of the raw 
material or in some one or more of the steps involved in the manufactur- 
ing process. Possibly these same seasonal changes in temperature 
and humidity may affect the product indirectly through their direct 
effects upon the personnel engaged in production and inspection. 
Erratic fluctuations in the quality of a product may accompany an 
erratic demand for the manufactured article, because such a demand 
usually necessitates modification of the production program such as 
adding to or decreasing the personnel, or otherwise modifying certain 
steps in the customary procedure. Similarly, secular or long time 
trends in the quality may accompany gradual changes in the character- 
istics of the raw material, improved methods for meeting specifi 2- 
tions, gradual changes in the nature of the personnel such as might be 
necessitated by outside changes in the labor market. All of these 
causes add their effects to those of the constant system, and hence 
may not be easily detected at the time of their initial occurrence. 

By detecting the existence of trends, statistics plays one important 
role in helping the manufacturer to maintain the quality of product. 
In other words, statistical methods help in estimating the significance 
of observed differences between the units of one month’s product or 
of differences in the average quality from month to month. For the 
ideal case, the system of manufacturing causes, defined as indicated 
above, should yield a uniform product in the sense that, for any one 
characteristic of the units, the probability y of producing a unit within 
a definite range X+65X of this characteristic can be defined by an 
equation involving the customary statistical parameters: 7. e., the 
average X, the root mean square or standard deviation ¢, and the two 
Pearsonian constants 6, and # in the form 


y=f (X, X, o; Bi, Be). 


A constant system of causes always exists after causes which are a 
function of time have been eliminated and such a system obviously 
produces a uniform product in the sense that for any one specific 
characteristic the number of units in a given period having a given 
magnitude will bear an approximately constant ratio to the total 





































American Statistical Association [82 





548 


number of units manufactured in that period. The manufacturer is 
therefore concerned with those statistical methods which will help 
him in deciding how much the product of one period may be expected 
to differ from that of another similar period without indicating a 
trend. 

If the graduation of the observed data by means of some one or other 
of the well known closed or open type probability functions is satis- 
factory in the light of some test for goodness of fit such as the Pear- 
sonian x? test, it may be assumed that the observed sample comes 
from a constant system of causes. If such a satisfactory graduation 
cannot be found, this fact may be taken as evidence of the existence of 
a trend the causes of which must be found and eliminated before a 
uniform product can be produced. 

After a satisfactory graduation has been found, it is possible to 
establish for each of the four parameters limits of variation which may 
be attributed to sampling. Variations in the observed values of one or 
more of these four parameters for a given month’s product outside 
their respective limits may be taken as indicating the possible existence 
of causes of trend. When deviations outside the limits are noted, 
special effort can then be made to ascertain the causes thereof. How- 
ever, so long as the observed parameters fall within their respective 
limits, the assumption is reasonable that the product is being manu- 
factured under a constant system of causes, or in other words that the 
product is uniform. In this way significant deviations in the quality 
of product can be detected and investigated whereas no time is lost on 
studying the causes of non-significant deviations in the quality of 
product. 

To aid the manufacturer in detecting the existence of trends is not the 
only réle played by statistical methods in maintaining the quality of 
product. Instead, they go further by furnishing clues as to the possible 
types of causes thereof. A knowledge that certain ones of these four 
parameters lie outside their respective limits suggests certain possible 
types of causes which might have produced this condition. With this 
information at hand, the manufacturer may draw upon his experience 
to suggest possible causes, of the type suggested by the analysis of the 
data, which might have produced the observed trend. Thus by fur- 
nishing clues which restrict the field of investigation to be covered in the 
search for the causes of trend, statistical methods effect an additional 
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THE QUANTITATIVE DETERMINATION OF 
FUNDAMENTAL CHANGES IN ECONOMIC DATA 


By O. Gressens, University of Illinois 


Alfred Marshall’s opinion that qualitative analysis had done the bulk 
of its work for the present, and that quantitative analysis must needs 
catch up, indicates the field of inquiry in economics which is in need of 
the most rigorous study. In spite of the scholarly work which has 
more recently been done in this field especially by Professor Persons, 
the quantitative analysis of economics still lacks the finished elegance 
which has been attained, many controversies notwithstanding, in the 
theory of economics. It is only comparatively recently that the 
analytical possibilities of statistics have been applied with discipline to 
economic data, and the theories and methods underlying this analysis 
are still in a comparative state of flux. 

The influences which determine the magnitude of economic data are 
numerous and of a heterogeneous character, and therefore often difficult 
to isolate. A rigorous analysis nevertheless demands the isolation of 
the effects of these influences, for little or nothing can be said about the 
quantitative result of any condition when it cannot be separately 
examined. A considerable differentiation must therefore be a formal 
part of the analysis of time series. This differentiation, however, does 
not necessarily have to be detailed to the extent of separating out every 
fluctuation of a series. Little of value would be gained by such minute 
analysis. The regular, fundamental changes which are manifest must 
nevertheless be analyzed out of the series in order that their effects may 
be quantitatively determined. 

The continuous exercise of acquisitive pursuits results in certain 
regular, cyclically recurring or uniformly advancing events, whose 
consistent characteristics are the more pronounced directly as the area 
over which they are contemplated is widened. Economic theory is 
fundamentally based on this same consideration. The theory of 
value, the law of supply and demand, the law of diminishing returns, for 
instance, are based on this concept. The quantitative analysis of eco- 
nomic data also proceeds in accordance with this same principle. 
These data, which form the most important division of the historic or 
time series, present the tangible results of economic activity, of the 
production and distribution, acquisition and consumption of economic 
goods. In their analyses the fundamental problem is to refine them by 
separating out those variables which are regularly recurrent in their 
influence and which therefore exert a continuous influence on the 
magnitude of the data. For these purposes there has been an in- 
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creasing tendency to make use of the rigorous methods of analytical 
mathematics. This has lent to the quantitative analysis of economics 
increased precision and accuracy, as it has increased the elegance and 
discipline of the methods. 

The magnitude and the behavior of economic data depend upon the 
concerted effect of economic conditions prevailing at any one time. 
The more inclusive the series under consideration (as, for instance, a 
general index of prices or of production), the greater is the regularity 
of the influences which shape its behavior. In other words the more 
adequate the sample which has been chosen for analysis, the more 
applicable are the laws of large numbers. This general principle can 
most effectively be observed when attention is turned to individual 
firms. The larger the firm, the wider its markets, the steadier and 
more diffused the demand for its products the more regular are its 
reactions. Obversely the smaller the enterprise, the more localized its 
markets, the more irregular and unpredictable are the effects of eco- 
nomic conditions and, by implication, the less satisfactory becomes 
a rigorous analysis. 

The quantitative analysis of economics turns about the central 
problem of the determination of the trend. The element of secular 
trend is, with few important exceptions outside of price series, an ele- 
ment of growth. Upon reflection the necessity of this condition be- 
comes clear. If no upward trend were present the only other two 
possibilities would be no trend, indicating a static condition with re- 
spect to growth, or a downward trend, indicating actual! regression. 
Neither one of the latter two cases are characteristic of economic 
society except in a few isolated incidents where an industry either has 
been partially superceded or is actually dying out. 

The determination of the secular trend itself is a thorny problem, and 
the most controversial question in the entire analysis of economic 
data. Its calculation is essentially a problem of curve fitting. The 
best fitting curve (or curves) to a series in general is the curve (or 
curves) which most satisfactorily expresses the fundamental law (or 
laws) of behavior of that series. In every case such a curve can only 
be fitted with a certain degree of approximation so that the trend 
represents then the best approximation to the fundamental movement 
of the series. 

This problem of curve fitting besides presenting difficulties on its 
mathematical side and labor on its mechanical side presents also 
difficult and controversial questions on its economic side. The de- 
sired trend must not simply fit the original data so as to present a 
smoothed curve roughly detzrmined by the loci of the points of ob- 
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servations, but it must fit these data in such a manner that it expresses 
only the fundamental or organic changes which take place over a period 
of time. Concretely, a fitted curve which would pass smoothly through 
the ragged curve of the original observations, but which would contain 
in its own behavior reflections of not only the major but also the minor 
fluctuations of the original series would not satisfy the economic def- 
inition of a secular trend. This definition predicates that the trend 
curve be a base from which these variations can be measured as from 
the ‘‘normal” position. The irregularly recurring variations, due 
either to sporadic contingencies or to the accumulating maladjustments 
in economic society, which cause the data alternately to rise above and 
then fall below the fundamental rate of growth which would normally 
be expected to obtain, must therefore not be considered in the curve of 
the trend. In fact, it is doubtful if very often a curve higher than the 
second degree of the potential series (y=a+br+cz*) will be found 
necessary in fitting a trend. 

There is an important characteristic of a rigorously fitted trend which 
deserves some consideration. Over the entire period over which the 
trend is fitted, all those factors which influence the magnitude of the 
data through a period of years, which, in other words, determine the 
underlying tendency, are taken up by the secular trend. Thus the 
variations of the cycles above and below the calculated normal have a 
net result of zero over the period over which the trend is fitted.! In case 
the secular trend is calculated by a method which does not present 
a rigorous treatment, the fluctuations from the trend in turn are the 
result of its chance placement. There is no assurance, in other words, 
that all the elements which contribute to the actual expansion of the 
series have been taken up by the trend, and are expressed by it. When 
a curve is rigorously fitted, on the other hand, this result is assured to 
a high degree of accuracy. The fundamental growth of the series in 
hand is thus expressed within the limits of each particular problem. 
Similar results, of course, may substantially be attained by much less 
rigorous analysis, but the degree of the resulting approximation depends 
upon the judgment and experience of the manipulator, and must neces- 
sarily differ from individual to individual. 

In fitting a trend to a series of data, the problem immediately arises 
as to the time over which the trend should be calculated, and after it 
has been fitted the further problem presents itself as to the exact mean- 
ing and conclusiveness of the trend. In general, the secular trend is 
arelative measurement. Only over along period of time when there is 


1 Cycle movements caused by abnormal conditions, such as the Great War, would, of course, consti- 
tute exceptions to this general condition. 
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no question but that the fundamental movement of the data has 
changed within the period under consideration, does the trend express 
absolute changes. In all other cases the trend element is directly 
dependent upon the period which is being considered. In other words, 
the secular trend is in all these latter cases an expression of the inclu- 
siveness in point of time of the data which is being analyzed. There can 
be no assurance, therefore, of the absoluteness of the calculated trend 
unless the analysis is extended over a period sufficiently long to exclude 
from the trend all short term influences. As economic conditions 
change, so do the trends of economic data change, and as there is no 
single criterion by which economic conditions can be interpreted, so 
there can be no single trend in any series which expresses the funda- 
mental movement of the data. The reliability of an unqualified state- 
ment of the trend is therefore a direct function of the time included in 
the analysis. And as the long time effects of economic conditions can 
only be accurately discerned in retrospect and over a long period, so the 
absolute secular trends of a series of data can only be calculated from 
observations covering a long span of time. Close approximations can, 
of course, be made from much shorter periods. 

The point which was raised by Dr. Ingraham! that the quantitative 
statement of growth by a least squares or moments curve is objection- 
able, because when the period of the calculated curve is changed slightly, 
the results of the added increments from, say month to month, some- 
times vary greatly, is in need of some discussion. That such inconsis- 
tent results are obtained by the method of testing as used by Dr. 
Ingraham, is true. This, though it does constitute a distinct warning 
against a slavish and indiscriminate use of methods, does nevertheless 
form no indictment against a rigorous refinement of economic data or 
time series. 

In the first place the secular trend of time series is defined as the long 
time movement of the data. It corresponds to the long time view of 
the economist. It is useless, therefore, to argue from a short term fitted 
trend, for such a trend must necessarily only be an approximation of 
the true trend. Dr. Lincoln Hall has developed a method distinctly 
designed for the current approximation of the trend by a series of mov- 
ing integrals.2, But such trend calculations must necessarily give way 
to long term corrections when sufficient time has elapsed to include 
these periods in along term. The movement of the trend does change, 
however, from time to time due to fundamental changes in economic 
conditions. In other words, the laws of growth change if not their 





1“*The Refinement of Time Series,” this JouRNAL, June, 1925. 
2“*A Moving Secular Trend and Moving Integration,” this Journat, March, 1925. 
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direction then at least their rapidity. Whenever such a change does 
occur in the data, a new trend period must be determined, and a new 
trend fitted to this period. 

In the second place, when tests are made on the results of any 
quantitative trend measurement, the periods over which the trends are 
calculated cannot be indiscriminately chosen, nor the method be 
judged on the basis of these random results. In the case of the method 
of least squares or of moments, the calculated trend will be affected by 
the nature of the end fractions of the period due to the weighting char- 
acteristic of these methods.' If the period through which a trend is 
fitted begins in a distinct slump and ends in a boom, obviously there is 
al upward bias introduced into the curve of trend. The obverse of 
this is, of course, equally true. The period therefore, through which 
the trend curve is fitted must begin and end in the same cycle phase. 

Another error introduced into the fitted curve is caused by the 
varying amplitude of the cycle fluctuations. The wider the swings 
away from the usual course, the greater will be the influence of this 
particular period upon the fitted curve. However, the business cycle 
leaves its imprint, in greater or less degree, on subsequent years. 
Economic conditions very seldom, if ever, return to the basis which ob- 
tained before the cycle setin. The error introduced into the calculated 
trend by the varying amplitude of the cycle swings is, therefore, sub- 
dued by the fact that the trend, by imbibing some of the effects of the 
cycles, expresses to this extent the influence of these periods upon the 
organic constitution of economic conditions. 

In the cases of many time series, the production and distribution of 
products which they represent is affected by the adjustment of con- 
sumption in accordance with the changing seasons. This is obviously 
not the case in series where consumers’ demand is relatively constant 
throughout the year, irrespective of the changing seasons. In cases 
where this effect does enter, however, an additional problem in quanti- 
tative measurement is introduced. This factor of seasonal variation 
has caused some discussion as to its ‘‘normal”’ influence. The true 
seasonal is perhaps impossible of exact measurement, as Professor 
Persons has pointed out.2, The problem of the measurement of this 
factor resolves itself therefore into a determination of the normal 
seasonal, and the measurement of this “normal.”’ 

The “normal” seasonal is a changing factor, just as all the economic 
conditions, which through their interactions determine the behavior of 
economic data, are continuously subject to variation.* Shifts in con- 


1“The Least Squares Criterion of Trend Lines,” by W. L. Crum, this Jovrnat, March, 1925. 
2 Review of Economic Statistics, Vol. 1, 1919. 
See author’s article in this JourNat, June, 1925. 
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sumers’ demands, which are always taking place, are as continually 
changing the nature of seasonal variation. In measuring the influence 
of this factor, therefore, the concept of a normal recurring seasonal, un- 
changing in its nature and measurable with mathematical precision is 
untenable. Here again the measurement must allow for variations. 
The case of the normal seasonal calculable by a single set of indices is 
only the one limit, while the increasingly irregular variations obscuring 
any regularity of movement is the other limit between which the 
changing seasonal must be calculated. As Professor Fisher has 
pointed out,' the element of seasonal variation ought properly be called 
a cyclical variation on account of the comparative regularity of its oc- 
currence. Besides this regularity, slowly changing amplitudes of 
fluctuation are also a distinguishing characteristic of the seasonal. 
This is in contrast to the irregularly varying amplitudes of cycle fluc- 
tuations. The fundamental characteristic of the seasonal is not only 
its comparative regularity but also the gradual changes which take 
place in it over a period of time. Thus the organic alterations which 
occur in economic data are shown not only in the secular trend, which 
expresses the change in growth of economic data, but also in the sea- 
sonal which describes the distribution of activity within the year. 
These changes are the effects of varying economic conditions. How- 
ever, economic conditions do not grow or change with the comparative 
smoothness and regularity indicated by these two factors. There are 
recurrent periods of transitional changes, of temporary maladjustments 
of supply and demand, monetary disturbances, the effects of which 
have come to be called by the name of the business cycle. 

The movements due to this group of influences are distinguishable 
from the secular trend and the seasonal variation in a fundamental 
respect. They are not characterized by the regularity of either of the 
other two factors. The term, cycle, is in fact a misnomer.? The busi- 
ness cycle is composed of a series of periods of maladjustments, due to 
the cumulative effect of the errors of judging effective demand on the 
part of producers and the cumulative effect of the errors of consumers 
in judging the market supply. 

The cycle is thus not a permanent condition but a transitional period 
of readjustments. Whatever influence the cycle has, which is of any 
permanence, becomes immediately a property of the secular trend. 
This is a necessary desideratum for a clear and definite concept of the 
cycle as well as of the trend. The secular trend has been defined as a 


1“ Our Unstable Dollar and the So-Called Business Cycle,” by Irving Fisher, this JourNaL, June, 
1925. 

2 For a criticism of this connotation see Prof. Fisher's brilliant article in the June, 1925, issue of this 
JOURNAL, 








= 


an 


a, ae ee ee ee 





— we we ONS 





89] Notes 555 


fitted curve which expresses the fundamental law of growth of the data 
from which it has been calculated, without those fluctuations which 
are wholly sporadic, and those less sporadic but, nevertheless, irregularly 
recurrent, on the one hand; and those other fluctuations which occur 
with true cyclical regularity and which express the characteristic dis- 
tribution of the data within the given twelve-month periods, on the 
other hand. The cycle fluctuations, in fact, weave through the spo- 
radic fluctuations and the seasonal cycles, and oscillate about the trend 
in longer, though irregular, wave motions. The net effects of these 
cyclical variations over a period of time are zero (or approximately 
zero), and this is in most cases the true criterion for the test of the 
adequacy of the fitted curve. The secular trend expresses all the ef- 
fects of those fundamental, organic changes, which have made for the 
growth or the decline of the data in hand. None of the other in- 
fluences, which cause fluctuations in the data can have a part in de- 
termining the growth. The element of seasonal variation also ex- 
presses a fundamental change. But this change is not one of growth 
or decline but merely one of characteristic distribution within any given 
year. Changes in the seasonal, therefore, express the effects of the 
shifting demands of consumers with regard to the seasons. A de- 
crease in the seasonal is the result of the greater steadiness of con- 
sumers’ demands throughout the year, while an increase in the seasonal 
indicates the adjustment of consumers’ demands with regard to the 
changing seasons. Any element of change, however, which results in 
an acceleration or a retardation of the rate of growth in the data, is 
taken up by the secular trend, and at these points where such funda- 
mental changes in the rate of growth do take place the old law of 
growth no longer obtains. It is these points, therefore, which mark 
the breaks in the trend, and which are accountable for the fact, often 
disconcerting in the application of methods of quantitative analysis, 
that it is necessary to fit several trends over a period of years to a given 
set of data. 

Dr. Olin Ingraham’s objections to the more rigorous methods 
(method of least squares, or of moments) of fitting these trends are, as 
was noted above, objections which must be referred to the statistical 
technique of curve fitting, for the periods of trend fitting cannot be in- 
discriminately chosen, as he has done in his illustrations, but must be 
chosen congruously with the periods where actual changes in the rate 
of growth take place. Such errors as still persist perhaps cannot be 
eliminated. But this does not vitiate the analysis entirely, as the 
greatest possible accuracy in measurement is far superior to entire con- 
jecture, and even in physical sciences, the measurements are only the 
best possible approximations. 
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RICHTER’S REVIEW OF “ELEMENTS OF BUSINESS STATISTICS ” 


To the Editor of the Journal of the American Statistical Association: 
Sir: 

I am moved to write you with reference to several points raised by Mr. F. E. 
Richter in his review of my book entitled Elements of Business Statistics, in the 
June issue of this JouRNAL. 

In the first paragraph of his review Mr. Richter states that “‘The fact that 
illustrations are almost entirely from the field of business does not entitle him 
(the author) to call the book” by that name. This title is, according to the 
reviewer, “‘misleading”’ because ‘‘there is mighty little philosophy (of statistics 
in business) in the book.” This is one of the most confusing statements I have 
ever seen with reference to the use of the term “business statistics.”” Inasmuch 
as the term vital statistics is used for statistical methods applied to vital facts 
and social statistics for methods applied to social facts, I fail to see what there is 
“misleading” about calling methods applied to business facts, ‘business statis- 
tics,’ except that unfortunately the word statistics has been used to indicate a 
science or method as well as a body of facts (for which I am not responsible). 
It might be interesting to Mr. Richter to read the comment of an actuary, an 
experienced insurance statistician and professor of statistics in one of our large 
universities in a published review of this book: “It is, as its name implies, an 
elementary text covering the application of statistical methods to business and 
economic problems.” I do not give the citation because I do not care to lean 
upon the support of his name and reputation, but he seems to have grasped with 
comparative ease the idea which eludes Mr. Richter. I will pass over the latter’s 
naive conception of philosophy as represented by Emerson, Santayana and James 
H. Robinson. 

Another unfortunate feature of Mr. Richter’s review is that he has, in spirit, 
misrepresented one of my statements on part of the work of the Harvard statis- 
ticians. He solicits support by stating that the ‘‘ Harvard University Committee 
on Economic Research must have been interested in reading the description of 
what Mr. Riegel calls ‘the so-called “Harvard barometer.”’” The word 
“Barometer” in this statement the reviewer very cleverly decapitalizes, to aid in 
the impression that I was adversely reflecting upon the value of this contribution. 
In fact I used the word “‘so-called”’ here merely because the popular title ‘‘ Har- 
vard Barometer” is not the official title of this business index. Mr. Richter 
must have been aware of my real object because just seven lines further on I 
state that ‘‘. . . the index is very valuable as an accurate picture of what has 
happened and is happening.” I object very much to the reviewer building a 
real animadversion upon one purely assumed for his own purposes. 

In the same paragraph the reviewer states that ‘Professor Persons would 
hardly relate any production curve to normal which was not corrected for 
seasonal movement.” This comment refers to a chart, on which the seasonal 
movement is plainly shown, and furthermore the next sentence below the chart 
is ‘The second step is to make allowance for seasonal fluctuations.” Is this 
deliberate misrepresentation or not? 
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There is one point in this review which is well taken and which I freely ac- 
knowledge indicates a bad blunder in statistical theory, that with reference to 
the logarithmic chart, and it shall be corrected at once. 

Other points to which Mr. Richter refers are: (1) a mistake in the middle 
initial of a person’s name; (2) a mistake in wording in discussing figures (the 
figures being approximately correct); (3) too great brevity in an illustration, 
which might cause the illustration to mislead; (4) the error of printing in a table 
of squares 100 00 000, instead of 100 00 00, as the square of 1,000; and finally 
(5) that the author has included in a list of requisites for graphic presentation a 
pencil and has not mentioned an eraser! These are hardly worth the space in 
the JouRNAL he has devoted to them. 

The first printing of this book contains numerous typographical errors and 
probably some others. This hardly justifies a reviewer in writing such a con- 
stricted and inadequate “review” as Mr. Richter has produced, which (1) Ut- 
terly fails to appraise the manner of presentation; (2) Omits any evaluation of 
the clarity of exposition; (3) Absolutely omits any criticism of the scope of the 
contents; (4) Mentions nothing as to suitability for the purpose of readableness. 
These are important characteristics in an elementary text and if Mr. Richter had 
considered them his “‘review”’ might have been useful as well as humorous. I 
might quote some favorable reviews by eminent statisticians but this would be 
taking advantage of their generosity to make up for Mr. Richter’s deficit. 

Finally, one may judge of the spirit in which this review was written when 
informed that the author in his preface stated that “little originality can be 
claimed for the contents except in the method of presentation. The works of 
others have been used unhesitatingly. ...” The reviewer disdained the 
author’s own words and paraphrased this to read “the book . . . may be ex- 
cused for being merely a warming over of the creative work of others.” This 
reminds one of the patient who showed a tender spot toa physician, who promptly 
pinched it to demonstrate to the patient the sensitiveness of exposed nerves. 

Sincerely yours, 
RoBert RieGEL 


A REJOINDER 


In answering Professor Riegel’s criticism of my review, I shall, like him, pro- 
ceed from the specific to the general. 

It so happens that shortly after my review of Mr. Riegel’s book was written, 
Professor F. C. Mills published his book, Statistical Methods Applied to Economics 
and Business. That is my idea of a perfect title. In the light of it, I need hardly 
enlarge further on the reasons for my criticism of the title of Mr. Riegel’s book. 
My allusion to Emerson, Santayana, and Robinson, by the way, was made be- 
cause I had the naive notion that they were masters of the English language, not 
because of any naive conceptions in regard to their philosophy. Readers of my 
review will recall that I was suggesting that in their works might be found excel- 
lent illustrations for a grammar. 
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I entirely disclaim any desire to “‘start’’ anything between the institutions on 
the banks of the Schuylkill and the Charles. I never for a moment supposed that 
Mr. Riegel was adversely reflecting on the value of the Harvard Barometer (small 
or large “‘b”’, as you will; Mr. Riegel uses a capital “‘b” on page 497 and a smal! 
“bh” on page 498). Neither did I cleverly decapitalize or consciously decapitalize 
the word barometer. It just so happens that in my work in business statistics 
over the last five years I do not recall hearing the Harvard index chart called the 
Harvard barometer, and in the paragraph under criticism I was simply calling 
attention to the fact that the author believed (and he apparently still believes, or 
he ought to have seen the point of my criticism, and in another place he acknowl- 
edges only ‘‘one point in this review which is well taken”) that ‘business is 
represented by a figure calculated from (1) the volume of pig-iron production and 
bank clearings outside of New York and (2) the prices of commodities as shown 
by two well-known index numbers of wholesale prices.’’ This is an incorrect 
description of the Harvard “B”’ curve. When “the works of others have been 
used unhesitatingly”’ by an author writing in the present tense, the least he can 
do for those others is not to misinform third parties as to the content of the works 
that have been used. That was and still is my criticism. I leave it to the Har- 
vard Committee to say whether it was well taken. 

As for the next “deliberate misrepresentation”’ (italics are Mr. Riegel’s): I 
can only call attention once more to Figure 120 on page 500, where a legend, re- 
ferring to a heavy line, says ‘‘ Figures Corrected for Secular Trend.” In the text 
above the chart, in a reference to the chart, the author says, ‘‘Thus we see from 
the diagram (italics are now mine) that production exceeded normal in 1905- 
1907. . . .” On page 501 occurs this: “If we deduct the monthly variations 
from the previous results (indicated by the heavy line on Figure 120) we obtain 
the cyclical movements corrected for seasonal variations, that is, the cycles of 
pig-iron production with trend and seasonal movement eliminated. These are 
shown on Figure 121.” (Italics are again mine.) Figure 121 it is which shows 
deviations from normal and not Figure 120, and I repeat that “ Professor Persons 
would hardly relate any production curve to normal which was not corrected for 
seasonal movement.” Did I deliberately misrepresent Mr. Riegel, or did Mr. 
Riegel once more misinform readers about some of the Harvard work? Again I 
leave the decision with the Harvard people. 

I failed, in Mr. Riegel’s opinion, to appraise his manner of presentation, evalu- 
ate his quality of exposition, criticise the scope of the book’s contents, or mention 
anything as to the book’s suitability or readableness. True. I also failed to 
mention every error in’ the book; as I intimated, I simply gave a sample. If 
Harold Bell Wright or Dr. Frank Crane should write a book in which some in- 
correct statistics were to be found, one would not bother to evaluate their work 
merely on the basis of some statistical errors. Accuracy, however, is certainly 
one of the elementary fundamentals of business statistics or any other kind. 
When, therefore, a professional statistician of the highest academic rank produces 
a book with as much inaccuracy and misinformation and with as poor a presenta- 
tion in some of its parts, and so little clarity in other parts (see the first passage 
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quoted in small type in my review) as the one under review, other statisticians, 
who recognize a fine piece of work in Mr. Mills’s recent book, can perhaps be ex- 
cused for feeling that Mr. Riegel’s book suffers seriously in comparison. 
F. E. RicuTer 
New York City 


PROGRESS OF WORK IN THE CENSUS BUREAU 
CENSUS OF AGRICULTURE 

The editing of the approximately 6,372,000 farm schedules was completed about 
the first of August. The punching of the five cards constituting the primary or 
major tabulation (about two-thirds of the total) was completed early in Septem- 
ber, and the punching of the three remaining cards is well advanced. 

Preliminary press announcements giving totals based on the results of the 
primary tabulation, have been issued for all states and counties in the United 
States. A bulletin giving the preliminary number of farms in the United 
States, by states and counties, for 1925, with comparative figures for 1920, 1910, 
and 1900, was issued in November. This showed a decrease of 75,735 farms, or 
1.2 per cent, since 1920. It is expected to begin the publication of the state 
bulletins in January, 1926, the preparation of the tables for these bulletins being 
well under way. 

BIENNIAL CENSUS OF MANUFACTURES 

Statistics for approximately 150 of the more important manufacturing indus- 
tries canvassed at the census for 1923 are being issued in 57 printed bulletins, 
each presenting statistics for a certain industry or a few related industries. 
Another bulletin will give summary statistics for all industries. Of these 58 bulle- 
tins, 56 have been published and the remaining 2 are in the hands of the printer. 
All the separate bulletins will later become sections of the complete and final 
report of the census of manufactures, 1923. 

Summary reports have been prepared for the 193 cities whose chambers of 
commerce, boards of trade, or similar organizations coéperated with the Bureau 
of the Census in collecting the data. These summaries, which are sent in manu- 
script form to the coéperating agencies, present, for all industries combined and 
for each important industry separately, comparative statistics for the census 
years 1923 and 1921. The Bureau does not itself publish or distribute these city 
summaries but permits the chambers of commerce or other local organizations to 
announce the figures in any way that they may see fit. 

Considerable work, especially in connection with the redrafting of the schedules 
of inquiry, has been done in preparation for the next biennial census, which will 
cover the calendar year 1925 and will begin promptly after the close of the year. 


RELIGIOUS BODIES 


Plans are being perfected for the next census of religious bodies to be taken in 
1926. A tentative multigraphed schedule, together with a letter inviting criti- 
cism and suggestions, has been sent out to prominent representatives of the 
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various denominations. The proposed schedule—which is to be filled out for 
each individual church—covers number of church members by sex, number and 
value of church edifices, church debt, expenditures, number of Sunday schools, 
and of Sunday school scholars, number of pastors and assistants, and number 
of other churches, if any, served by the pastor or his assistants. 

As indicated by the last religious census relating to the year 1916, there are 
about 200 religious denominations or sects to be canvassed and over 225,000 
churches or church organizations. 


STATISTICS ON THE BUILDING AND LOAN ASSOCIATIONS 


The United States League of Local Building and Loan Associations passed the 
following resolution at its convention in Kansas City in June, 1925: 

“WHEREAS, twenty-five State Building and Loan Leagues have endorsed a pro- 
posed intercensal survey by the Bureau of the Census, of the United States Depart- 
ment of Commerce, covering building and loan associations of the United States; now, 


therefore, be it 
“‘ Resolved, That the United States League of Local Building and Loan Associations 


go on record as advising the said Bureau and Department of their earnest desire that 
such survey be made, and pledging coéperation in making it a success.” 

A tentative schedule was prepared after consultation with Mr. C. Clinton 
James, President of the United States League of Local Building and Loan Associa- 
tions, Dr. Horace F. Clark, of Iowa State College, Ames, Iowa, and Mr. Frank A. 
Chase, of the American Savings, Building and Loan Institute, Kansas City, 
Missouri, and mailed to the secretaries of the State Building and Loan Leagues 
and to about 100 associations with the request that they report whether or not it 
would be a comparatively easy matter to supply the data desired. Up to the 
present time no decision has been reached regarding this proposal. 


VITAL STATISTICS 


After an absence of eight weeks, Dr. Davis, Chief Statistician for Vital Statis- 
tics, has returned to Washington from the tour of medical statisticians who 
visited statistical offices in Copenhagen, Stockholm, Oslo, Edinburgh, London, 
The Hague, Berne, and Geneva. This tour was conducted under the auspices of 
the Health Section of the League of Nations, the object being to study the 
methods of compiling and classifying statistical data in various countries with a 
view to harmonizing differences and promoting the international comparability of 
the statistics. 

CHANGES IN PERSONNEL 


Mr. Eugene F. Hartley has resigned the position of Chief Statistician in the 
Bureau of the Census to accept a position in New York as statistician in the 
International Business Machine Corporation. Mr. Hartley has been connected 
with the Bureau of the Census since 1900 and has been Chief Statistician for 
Manufactures since 1917. 

The vacancy in the position of Chief Statistician for Manufactures, created by 
the resignation of Mr. Hartley, has been filled by the appointment of Mr. LeVerne 
Beales. 
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Mr. Beales has been employed in the Bureau of the Census since July, 1909. 
He was at one time secretary to Dr. Durand when the latter was Director of the 
Census. He has since then been employed on statistical or editorial work in 
various divisions of the Bureau, has written the text for several of the census 
reports, and assisted in the preparation of the monograph on Increase of Popula- 
tion. Since 1923 he has been closely associated with the work of the biennial 
census of manufacture, acting as editor and as statistical assistant to the Chief 
Statistician. 

Mr. George B. Weitzel, who was the representative of the Bureau of the Cen- 
sus during the taking of the recent census in Detroit, has resigned his position in 
this Bureau to accept a position with the Attendance Department, Board of 
Education, of that city. 

J. A. H. 


THE SECOND INTERNATIONAL CONFERENCE OF LABOUR 
STATISTICIANS AT GENEVA; APRIL 207TH TO 25rn, 1925 
RESOLUTIONS ADOPTED BY THE CONFERENCE 


I. Cost-or-Livinc INDEX NUMBERS 


(intended to measure changes in the cost of living in a given country at different 
periods) 


(1) In each country statistics should be published showing changes in the cost 
of living. They should cover the food, clothing, heating and lighting, rent, and 
miscellaneous groups. Each group should include the most important items of 
consumption, and especially those articles the price changes of which may be 
taken as representative of changes in the prices of articles not included. 

Separate index numbers should be published for each of the groups and general 
index numbers for all the groups combined. In calculating these index numbers 
weights should be used proportionate to the relative importance in consumption 
of the items in each group and of the different groups. Changes in the cost-of- 
living in a given period should be calculated by using weights which remain un- 
changed throughout the period. When changes in consumption render the 
weights inappropriate a new series of index numbers based on new weights should 
be calculated. 

It would be desirable that as far as possible the information published should 
enable the differences which would result in the index numbers from the inclusion 
or exclusion of direct taxes to be determined. 

(2) A series of national index numbers, calculated by using weights based on 
the consumption of industrial workers generally should be published. In cases 
where desirable, index numbers may also be calculated for other classes of 
workers. 

In countries in which there are marked differences in economic conditions in 
different districts or among different sections of the working classes, it is desirable 
that separate index numbers should be published for such areas and classes. 
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In these cases the weights used should, as far as possible, be based on working- 
class consumption in the districts or among the sections of workers to which the 
index numbers relate. 

It may be convenient that the national index should be computed by combin- 
ing the separate indices, account being taken of the relative populations in the 
different groups. 

(3) In view of the existing inadequacy of statistics of consumption, the most 
satisfactory method of determining the weights of the different items and 
groups appears to be that based on a standard family budget obtained as the 
result of an enquiry into the average expenditure of a number of families during 
a given period. 

Where statistics of family budgets are not available statistics of aggregate 
consumption may be of some value in the determination of the importance of 
certain articles of consumption in the country as a whole, account being taken 
of the fact that certain articles are used for purposes other than human con- 
sumption. 

Theoretical budgets computed from all information available may also be 
used. 

(4) Care should be taken to ensure that the quality of each item for which 
prices are obtained at different dates and the methods of collection and calcula- 
tion remain unchanged during the period covered. 

(5) Index numbers should be computed as often as may be necessary having 
regard to the extent to which prices are changing. In present conditions they 
may with advantage be computed at monthly intervals, but under more stable 
conditions their publication at less frequent intervals may be adequate. 

(6) It is desirable that in those countries in which no family budget enquiries 
have been held since 1920-1921 such enquiries should be undertaken as soon as 
economic conditions are sufficiently favourable, and, if possible, not later than 
the year 1928. 

(7) It is desirable that the governments, in collaboration with the Interna- 
tional Labour Office, should agree to adopt the same year as base for the calcula- 
tion of a new series of index numbers. It is suggested that the year 1930 would 
be suitable for this purpose. 

(8) It is of great importance that the principles adopted in the selection of 
articles and of the different qualities of such articles, in the collection and com- 
putation of the price data and in the fixing of the weights, should be indicated 
in full detail when such: index numbers are first published, also the precise data 
to which the index numbers refer. 


II. UNEMPLOYMENT STATISTICS 


(1) In countries in which a widespread system of unemployment insurance 
exists the information obtained from the working of such a system forms the 
best basis for unemployment statistics. 

(2) These statistics should furnish the following information as a minimum: 

(a) Annually, the total number of workpeople insured against unemploy- 
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ment, which should be related so far as practicable to the total number of 

workers. 

(b) Monthly, the total number of unemployed on a given day in receipt of 
benefit, and the total number of insured workers unemployed (whether on 
benefit or not) on the same day. 

(c) Monthly, the percentage that on a given day the total number of insured 
unemployed (whether on benefit or not) forms of the estimated number of 
insured workpeople. 

(d) Annually, the total amount paid during the year in benefits. 

A method according to which, as in the Netherlands, a percentage is calculated 
between the number of days of unemployment during one week and the total 
number of days of work which might have been performed is one which is worthy 
of attention. 

(3) Where statistics based on unemployment insurance, compulsory or 
voluntary, are not available, it is desirable to obtain from workers’ organisations 
the following information: 

(a) Monthly, the total number of unemployed on a given day and the 
percentage they form of the total membership covered by the enquiry. 

(b) Annually, the number of workers covered by the enquiry as a percentage 
of the total number of workers in the corresponding industries or occupations. 
Even when statistics based on unemployment insurance become available it is 

desirable, for purposes of comparison, to continue trade union statistics as long 
as they are reliable. 

(4) The statistics derived from public employment offices should give: 

(a) The number of workpeople registered on a given day of the month as 
seeking work, with the total number of vacancies remaining unfilled on the 
same day. 

(6) The number of workers’ applications registered, of vacancies notified, 
and of vacancies filled, during the month. 

(5) Employment exchange statistics should be compiled so that as far as 
possible unskilled workers are distinguished from other classes of workers. 

(6) Information as to the state of employment should also be published 
periodically, if possible monthly, based on returns made by a representative 
number of employers. 

(7) In countries in which satisfactory information concerning unemployment 
cannot be obtained by the means indicated above, it is desirable that an attempt 
should be made to obtain at the general population census, or at anindustrial 
or cccupational census, information as to the amount of unemployment, or that 
special enquiries relating to the whole population or to an adequate sample 
thereof should be made from time to time with a view to ascertaining the number 
and condition of the unemployed. 

(8) The statistics indicated in the preceding resolutions should distinguish 
males and females and should, so far as possible, give figures for separate occupa- 
tions in the case of statistics arising out of the activities of employment exchanges, 

and for separate occupations or industries as may be most convenient in the 
case of trade union and insurance statistics and in the case of special industrial 
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or occupational enquiries. The classifications used should be based on the 
classifications adopted in the general population census in so far as these are 
applicable to employment exchange operations. 

(9) It is desirable, for comparing statistics of unemployment internationally ; 

(a) That precise and detailed information should be published and kept 
up to date as to the methods adopted in compiling the various statistics of 
unemployment; in particular, each country should indicate any legislative 
or administrative changes affecting the value of its unemployment statistics; 

(b) That copies of all forms and questionnaires used in collecting the 
various statistics should be forwarded to the International Labour Office; 

(c) That in each country an enquiry should be made to determine, as 
exactly as possible, the representative value of the unemployment statistics 
in relation to the ‘‘ideal statistics” which would give, at any date, the total 
number of unemployed in relation to the total number of workers. For this 
purpose it is agreed: 

(i) that the ideal population “field’”’ to which the statistics should relate 
should be all persons whose normal means of livelihood is employment 
under contract of service as well as those persons not hitherto wage earners 
who seek to become so; 

(ii) that the unemployment measured should exclude that due to sick- 
ness, invalidity, participation in trade disputes or voluntary absence from 
work, and should be limited to unemployment due to lack of employment 
or to lack of work while in employment; 

(iii) that the necessary and sufficing condition for being enumerated as 
unemployed is that the individual must have been not at work for one day 
at least. 

(10) It is desirable that the different statistics of unemployment (insurance, 
trade union, and employment exchanges) should be presented together, pref- 
erably in graphic form, so that they might be co-ordinated and checked one 
with another and as clear and correct an idea as possible of the fluctuations in 
unemployment obtained. 

(11) Statistics of short-time employment should, if possible, be given 
separately from those of whole-time unemployment. 


III. INTERNATIONAL COMPARISONS OF REAL WAGES 


The Conference recognises the great theoretical and practical importance of 
the international comparisons of real wages which were initiated by the British 
Ministry of Labour and the interest of the researches which have been subse- 
quently carried on by the International Labour Office and which have certainly 
led to progress in this sphere. 

The Conference, however, expresses the opinion that, since these figures can- 
not be considered as representative of the real differences in the workers’ standard 
of living which exist from country to country, it is desirable that a second series 
of index numbers should be published along with the first, measuring the relative 
standards of living of the working classes in the different countries. In this 
case it would be necessary to base the calculations on the actual earnings of 
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working-class families and to take account of the differences in the physiological 
needs of the population due chiefly to climate and race. 

The International Labour Office, in publishing these index numbers, should 
give in the greatest detail the original figures, the methods of calculation adopted, 
and any reservations necessary. The different countries should send to the 
International Labour Office the fullest information, together with a detailed 
description as to the nature and value of their figures. 


IV. CLASSIFICATION OF INDUSTRIES 


The Conference, realising, as it did in 1923, that it is not possible at the 
present time to reach an agreement as to the scientific bases of an international 
classification of industries, but recognising that it is necessary to ensure in the 
immediate future some form of international comparison, at least for the prin- 
cipal industries, recommends that pending the time when a complete classifica- 
tion becomes possible there should be drawn up, after consultation with the 
various statistical services and international organisations concerned, a provi- 
sional list of the most important industries. 

In order that the question of drawing up a more complete list of industries 
which would serve as a base for international statistical comparisons might be 
thoroughly investigated, the Conference recommends the International Labour 
Office to invite the Governments to examine the draft classification which has 
been submitted to them, and to inform the Office of the difficulties which they 
would experience in adopting it, calling attention to cases where the draft list 
might be reduced or expanded. 


MISCELLANEOUS NOTES 


Meeting on Ways of Using Recorded Data to Estimate Future Trends.—A Din- 
ner Meeting of the American Statistical Association was held on Friday evening, 
September 25, 1925, at the Aldine Club in New York City. One hundred and forty- 
five persons were present. President Robert E. Chaddock was in the chair. 

The first paper, entitled The Use of Recorded Data to Verify Future Trends, was 
presented by Leroy E. Peabody, Associate Highway Economist of the United States 
Bureau of Public Roads, Washington, D.C. He discussed the characteristics of pos- 
sible ways of calculating trends and showed that the best fitting curve (as for example 
a cubic parabola when applied to business failures) may give a forecast that seems 
decidedly unreasonable on the face of it, and is later shown by the facts to be highly 
erroneous. 

He pointed out further, that, in many cases, if a problem is broken down into its 
component parts, it becomes very much easier to arrive at a reasonable basis for deter- 
mining the trend. For example, the population factor is relatively easy to forecast, 
and this can be taken out of the trend of motor vehicle registration, leaving as a resi- 
due cars per capita, a quantity concerning which it is feasible to form fairly definite 
conclusions, such as that we are not likely to use more than a given number of cars per 
family. Similarly the ton mileage of railway freight may be reduced to such compo- 
nents as population and tons per capita or tons loaded and average distance of haul. 
It is well thus to analyze the problem in several ways and then to compare the final 
results obtained by the various methods, 
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Another procedure that may be used in verifying the trend is to divide the data into 
sections and, after producing separately the trends of the respective parts, to combine 
these trends intoa whole. If the result obtained by this consolidation is similar to the 
trend as produced from all the data, one’s confidence in the accuracy of the forecast is, 
of course, greatly increased. 

The next phase of the discussion was given by Dr. Henry Schultz, Director of Sta- 
tistical Research of the United States Children’s Bureau, Washington, D. C.; the sub- 
ject of his paper was Empirical Trends and the Law of Demand. He began by endors- 
ing the views of E. C. Rhodes, who holds that, in smoothing a curve, it is necessary to 
compromise between two principles which are necessarily conflicting. The two prin- 
ciples are: 

1. We must obtain a curve without the sinuosities which are not inherent in 
the data. 

2. We must get a curve which “‘fits’’ by least squares. 

Somehow, by careful balancing of these two mutually destructive principles, 
we can obtain for any data a curve which is reasonably smooth and which at the 
same time gives a series of errors which have a reasonably small mean square 
error. The smoothness we obtain is not the best, nor is the sum of the squares 
a minimum. 


Dr. Schultz pointed out that, in most cases, 


it is impossible to get an objective test of the wisdom of one’s compromise be- 
tween smoothness and goodness of fit. Where, however, we are dealing with 
such theoretically correlated series as production and prices, or consumption and 
prices, for a period of years, we may, at least, get a rough check on the suita- 
bility ef the compromise curve for the problem in hand. 


This principle was illustrated by obtaining demand and price curves for sugar and 
then taking the deviations of each of these quantities from their respective trends. 
The trends were computed by a number of different methods. The conclusion was 


that the use of comparatively simple curves—such as would be selected by most 
statisticians as being the best compromise between smoothness and goodness of 
fit—yields better (a priori) results than those obtained through the use of more 
complex curves. The highest correlation (R=0.78) between the trend ratios 
is obtained when the cubic is taken as the trend of prices as well as consumption. 
The degree of correlation diminishes as the sensitiveness of the price trend is 
increased, until, as in the case when the Rhodes curve is used, the very existence 
of correlation becomes somewhat doubtful. The confidence which statisticians 
are wont to place in comparatively simple curves would thus appear not to be 
misplaced. 


dr. Schultz also emphasized the fact that 


the common method of fitting a straight line to data involves the arbitrary 
selection of one of the variables as the independent variable X, and the assump- 
tion that an observed point fails to fall on the line because of an error or de- 
viation in the independent variable Y alone, the X variable being allowed no 
deviation. The result of this method is that we get one straight line if we treat 
one of the variables as an independent variable, and quite a different one if we 
treat the other variable as the independent variable. 


The fact is, however, that 


in practically all cases of observed data, relating to demand and supply, the X 
variable is as subject to errors or deviations as the Y variable. We know, for 
example, that the statistics of sugar consumption are as subject to error as are 
the statistics of sugar prices, if not more so. It therefore appears that a better 
fitting straight line will be obtained by assuming that an observed point fails 
to fall on the line, because errors are present in both X and Y. The curve fitting 
method which is based on this assumption is much more complicated than the 
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ordinary method. To the best of my knowledge, the direct application has 
thus far been confined to the simplest of all curves—the straight line. .. . 
When this ratio is equated to unity, that is to say, when the observations in X 
are given the same weight as those in Y, or when the two variables are assumed 
to be equally accurate or equally inaccurate, this method has a simple geomet- 
rical interpretation: it makes the sum of the squares of the normals—perpen- 
dicular distances of the points from the curve—a minimum. . . .[The line thus 
obtained] may, therefore, be looked upon as the line of best fit Within the 
limits of observation it is the best demand curve that may be derived from the 
data at hand. This line yields a coefficient of elasticity of -0.506; that is to say, 
a change in the price of sugar of 1 per cent will, under normal conditions, affect 
the consumption by only one-half of one per cent. 





The speaker stated that the application of the same method to the ratio as derived 
from the other trends, gave practically the same values for the coefficient of elasticity 
of demand. 

In the discussion which followed later, Dr. Schultz expressed the view that the 
methods of curve fitting which he had used were more satisfactory for interpolation 
than for extrapolation. 

The third speaker on the program was Mr. Ray B. Prescott, of New York City, 
Consulting Statistician, who spoke on the subject, The Practical Treatment of Secular 
Trends. He emphasized the fact that if, in the calculation of trends, one is to obtain 
results worthy of confidence, complete familiarity with the nature of the data is es- 
sential. In his view, it is out of the question to apply the same method to all classes of 
data, but he stated that his experience with production data shows that the use of the 
Gompertz curve commonly gives very satisfactory results if the data cover a period of 
sufficient length. Then a forecast on this basis may be carried several years into the 
future with comparative safety if proper judgment is used, for, in many cases, the 
actual figures, when obtained later, show the errors to be surprisingly small. 

Mr. Prescott also called attention to a device which he had found very helpful, for 
testing the reliability of a method of curve fitting, namely, to fit the curve to one half 
of the data and see how well a continuation of the curve fitted the remainder of the 
data. The process may then be reversed by fitting the curve to the other half of the 
data. If the curve is following the law of the data, it evidently should give a satis- 
factory fit no matter which half of the data is used. 

The speaker believed it to be useless to attempt to fit mathematical curves to data, 
unless records for a long period are available. He pointed out the necessity of having 
the data to which the curve is fitted begin and end at the same phase of the business 
cycle. This precaution is specially important when the series of years covered is brief. 

Dr. Edgar Sydenstricker, Statistician in Charge, United States Public Health Serv- 
ice, Washington, D. C., presented the last of the four papers, his subject being, Time 
Variables in Vital Statistics. He stressed the necessity of the statistician being 
thoroughly familiar with the field to which he attempts to apply statistical technique. 
In his view, the statistician, if he is to avoid the misuse of statistical methods, must 
first become an expert in his chosen line and then learn how to apply statistics to his 
own problems. He felt that the world has very little room for the ‘‘pure statistician.” 
Statistics divorced from science is nothing but mathematics. 

Dr. Sydenstricker stated that, until very recently, no serious attempt had been 
made to do forecasting in the field of public health, but, at present, efforts along this 
line are meeting with considerable success. The forecasting of epidemic conditions is 
a matter of great importance, for it enables health officials to prepare, in advance, 
suitable measures of control. 

According to his experience, different diseases, being due to different causes, obey 
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different laws, and hence the data pertaining to each different kind of sickness must be 
dealt with by the method appropriate to that particular disease. 

It is always necessary, in dealing with mortality or morbidity statistics, to dis- 
tinguish carefully between trends, cycles, seasonal fluctuations, and the sharp varia- 
tions caused now and then by epidemics. 

In discussing the best method of locating trends, he stated that his experience in- 
dicated: 

A. That the determination of specific variables is more accurately and easily 
accomplished by graphical methods, than by mathematical, unless the 
data are sufficient in quantity to warrant the hypothesis of a law. 

B. That even when mathematical generalization is possible and advisable, it 
can be applied to great advantage only after the nature of the variables 
is ascertained by other means. 

Dr. Sydenstricker also mentioned the fact that considerable success has been ob- 
tained in predicting the duration of an existing epidemic wave, by studying the shape 
of the curve recording the number of cases reported each day. He felt that the possi- 
bilities for forecasting in the field of public health are large, but that progress in this 
line can be made only through careful and painstaking study. 

The discussion was begun by Dr. Frederick R. Macaulay, Statistician of the Na- 
tional Bureau of Economic Research, New York City. He called attention to the fact 
that many types of curves that are extremely satisfactory for interpolation become 
absurd when extended any distance outside of the confines of the data. This is the 
case with most types of parabolae. Mathematical formulae suitable for extrapolation 
must be of such a nature as to give results that seem reasonable when extended beyond 
the limits of the data. 

In interpolation, the Gompertz curve is, in many instances, extremely satisfactory, 
but, in other instances, fits the data quite badly. An example of the latter case is the 
application of the curve to the production of pigiron. In the case of pig iron, a much 
better fit is obtained by fitting a Gompertz curve to the logarithms of the data, rather 
than to the original figures. 

The discussion was continued by Professor Theodore H. Brown of the Bureau of 
Business Research, Harvard University, Cambridge, Mass. Professor Brown pointed 
out that the results gained by fitting the trend to logarithmic data might be very dif- 
ferent from those obtained by fitting it to natural data. He called attention to the 
fact that the use of logarithms presupposes that the variable changes each year by a 
constant percentage which, in many instances, is far from true. Very commonly, 
therefore, other treatments are superior to the logarithmic. 

He emphasized the necessity of selecting just the right method for fitting a trend to 
be used for the purpose of extrapolation, for he stated that he had known many in- 
stances in which a variation in methods resulted in obtaining forecasts highly con- 
tradictory in nature. This diversity of results has caused business men to be sceptical 
of statistical forecasting in general. They are inclined to revert to rough-and-ready 
methods. In making forecasts, they, as a rule, insist on throwing out all years which 
they believe to be abnormal. While this method has much to commend it, it does, of 
course, involve personal judgment and must necessarily be affected by individual bias. 

The discussion was closed by Dr. Arne Fisher of the Western Union Telegraph 
Company. He stated that he had had considerable practice in forecasting and that, 
in his view, the Gram curve gave much better results than any of those mentioned by 
the speakers. Furthermore, his experience indicated that it could be more easily 
fitted to the data than could curves of the type previously discussed during the eve- 
ning. 

The meeting then adjourned. 
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Meeting on the Problem of Forecasting City Populations with Special Reference 
to New York City.—A dinner meeting of the American Statistical Association at which 
147 persons were present was held on Friday evening, October 30, 1925, at the Aldine 
Club, 200 Fifth Avenue, New York City. President Robert E. Chaddock introduced 
the presiding officer, Dr. Haven Emerson of Columbia University, Chairman of the 
Board of Directors of the Cities Census Committee. 

The first speaker was Dr. Alfred J. Lotka, Supervisor of Mathematical Research of 
the Statistical Bureau of the Metropolitan Life Insurance Company. His subject was 
The Empirical Element in Population Forecasts. He pointed out the fact that 


we regard as empirical in the narrower sense those relations between objects or 
events, which have been found by experience or trial to hold in certain cases, but 
which have no apparent connection with more general principles. The utility of 
such empirical relations or laws is limited in two directions. First, since we do not 
know the general law underlying the empirical relation, the precise form of this must 
be determined afresh for every instance that arises. Second, after we have deter- 
mined the special form of the relation for a particular case, we shall always feel un- 
safe in extending the application of the law much beyond the range of actual obser- 
vation, since it contains unknown elements which may become dominant factors 
outside this range, and occasion more or less marked departures from the ‘‘law”’ 
outside this range. 

An empirical formula is therefore not so much the solution of a problem as the 
St to such solution. It is a point of interrogation, an animated question 
mark. 


He further showed that the Pearl-Reed formula can be written in such a manner as 
to specify definitely the empirical factors. He said:—‘‘The law of Pearl and Reed 
may be written 


(= 
hei, OE 
F) 


signifying that the rate of increase r at any time is given by a certain incipient rate of 


increase r,, diminished by a term r —— which is proportional to the number N of the 
47 2 
existing population, expressed as a fraction of the ultimate population N~.” In his 
view, this latter constant presumably stands for a complex comprising the death-rate 
and the influence of migration into the city, both from other parts of the United States 
and from foreign countries. This complex is, of course, difficult to analyze; hence it is 
not easy to tell how it will be affected by changes in economic and social conditions. 
As to the other constant r which, in the case of the population curve for the United 
States, as fitted by Pearl and Reed, has the value of 3.134 per cent per annum, he con- 
tinued as follows: 


What makes this constant? What gives it this particular value of about 3 per 
cent per annum? 

Seeking an answer to this question we consult present conditions as a key to the 
past. In 1920 the age schedule of maternity frequency (fertility) among white 
married women was that shown in the table presented. Now we may inquire: By 
how much must this 1920 age schedule of maternity be multiplied to give an annual 
rate of increase of .03134? The requisite computations have been carried out on 
two different bases, namely, (1) using the 1920 figures of mortality (life table), 
marital ratio, and maternity frequency; (2) using the oldest reliable English life 
table (1838-1854) and a somewhat augmented (arbitrarily) marital ratio, to corre- 
spond more nearly to conditions prevailing, say, in the eighteenth century. 

On the basis of these figures, and of an age distribution corresponding to geometric 
increase at the incipient rate, .03134, it was found that the 1920 fertility figures must 
be multiplied by 2 according to basis of computation (1), or by 2.3 according to 
basis of computation (2). On computing the interval between births corresponding 
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to these factors 2 and 2.3, it is found that in the age group of maximum fertility the 
interval between births comes out as 16 months and 13} months respectively. 
Oneis thus led to infer that the limiting factor which determines the constant .03134 
and sets the extreme limit to the rate of growth of a population unfettered by popu- 
lation pressure, is the physiological necessity of a certain minimum average interval 
of about one year or a little more between successive births. 


The next speaker was Dr. Lowell J. Reed, Head of the Department of Biometry and 
Vital Statistics of Johns Hopkins University. The subject of his address was The 
Growth of the Population of New York City in Relation to that of Neighboring States. Dr. 
Reed described the use of the Logistic Curve in forecasting the population of New 
York City and the surrounding region. He showed that, contrary to the contention 
of the critics of the method, it is possible by summating the curves representing the 
components, to arrive at a curve accurately representing the aggregate. A diagram 
which he presented showed that the Logistic Curve fitted the recorded population of 
New York City closely over a long period of time, while the curve derived by Dr. Wil- 
son failed to fit the data, actually crossing at a sharp angle the line representing the 
general tendency of population growth in recent years. Under these circumstances, 
he felt that it was much wiser to put faith in the trend shown by the Logistic Curve 
than in that obtained by the use of Dr. Wilson’s formula. 

The third speaker of the evening was Mr. Ernest P. Goodrich, Consultant on 
Traffic Studies relating to the Regional Plan of New York and Its Environs. The 
title of his paper was A New Estimate of New York City’s Future Population. Mr. 
Goodrich described ‘‘the methods followed in preparing the so-called new estimate of 
the size of future New York.” He contended that it was unsafe to estimate the aggre- 
gate by summating the results for the separate parts, the reason being that the sum is 
“usually found to aggregate a greater total than is disclosed when the same methods 
are applied to the total aggregation.’’ He stated that experience shows the average 
deviation to be larger in small aggregations than in large aggregations of population. 
This being true, it follows that the way to minimize errors is to deal first with the total 
and then with its subdivisions. In forecasting the population of New York City, he 
has followed the plan of first estimating the population of the entire United States and 
later apportioning this total to the different sections. He has found it best to consider 
as a group all cities in the United States which, at any given census date, were large 
enough to contain a specified fraction of the entire population—as for example, one 
per cent. He has further attacked the problem not only by studying the changes in 
the percentages of the population falling in the respective groups of cities or individual 
cities, but also by noting the changes in the rate of increase from decade to decade. 

If the trend of the last few decades in the percentage of growth of the population of 
the United States should continue, the indicated population in 1960 would be 138,000,- 
000. If, however, this trend should curve off toward the horizontal, the population at 
the same date might be estimated at a figure as high as 169,000,000. Mr. Goodrich 
felt that it was probable that the population in 1960 would lie somewhere between 
these limits. 

In planning for the growth of a city, it is wise to make too high rather than too low 
an estimate. By applying to the higher estimate for the nation as a whole, the frac- 
tion representing the anticipated percentage of the total population that will be resid- 
ing in New York City in 1960, it appears that the city proper may be expected to have 
a population of 8,500,000 at that date. A similar method would indicate that it would 
have nearly 12,000,000 inhabitants by the year 2000. The figure for the population 
of the entire New York region estimated in the same manner is approximately 14,000,- 
000 for 1960 and 20,000,000 for the year 2000. Mr. Goodrich emphasized the view 
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that, since immigration is being shut off and the birth-rate is falling rather rapidly, 
there is every reason to expect a rate of population growth in the future slower than 
that prevailing in the past. 

The discussion was opened by Dr. O. E. Baker, Economic Analyst of the Bureau of 
Agricultural Economics of the United States Department of Agriculture. He called 
attention to the fact that, in considering the limiting factors of population, it seems 
best to deal with a unit comprising both the United States and Canada. The three 
principal natural resources, lack of any one of which can be depended upon to prevent 
an indefinite expansion of population are coal, iron, and soil. Because of the large 
supplies of coal and iron in this region it appears that scarcity of farm land will check 
population growth before the strain upon the mineral resources mentioned has become 
very intense. There is much arable land in both the United States and Canada that 
can be brought under cultivation if the price of farm produce rises to a high enough 
figure. The intensity of cultivation of the land already under crops may also be 
greatly increased. Furthermore, the diet can be changed by introducing more vege- 
table and Jess animal food, for it takes much more space to furnish the same numberof 
calories of energy in the form of meat than in the form of grain or other vegetable 
products. To produce a ton of beef requires a much larger space than is needed to 
obtain a ton of pork, and to raise a ton of pork requires far more land than to grow a 
ton of grain. After considering all of the known factors, it seems that we might, 
without serious difficulty, double our area under cultivation, increase the average 
crop yield per acre by 50 per cent, and change our diet to something like that existing 
in Germany before the war. Were this done, the country would support 500,000,000 
people. Should we reduce our standard of diet to that of the Japanese, the nation 
might have a population of at least a billion. 

Improvements in methods of production during the last few decades have greatly 
increased the quantity of products turned out by the average farm family, the increase 
being 100 per cent since 1870. Asa result, farm population has been almost station- 
ary for some time, even though the total population of the country has been growing 
rapidly and eating as much asever. There has, however, been some shift from a beef 
to a pork diet. At present, we export about one-eighth of the products of our crop 
land, but the volume of exports is likely to shrink in the future. As time passes, the 
movement from the farm to the city may diminish slightly, but there are apparent no 
rural factors preventing the fulfillment of the conclusions reached either by Mr. 
Goodrich or by the Pearland Reed formula. Dr. Baker concluded by stating that the 
future growth of the population of New York City will depend largely upon migration 
—whether from the farms, from the villages, from other cities or from foreign countries 
—and that, therefore, the logical method of forecasting the city’s growth seems to be 
to study with especial care these respective types of migration as well as the rate of 
natural increase. The results thus obtained should then be combined. 

The discussion was continued by Mr. H. B. Stryker, Commercial Survey Engi- 
neer for the New York Telephone Company. He outlined some of the methods used 
by the Bell system in making population estimates. The practice of that organization 
is to analyze past growth into such elements as natural increase and net migration. 
This analysis is followed by a study of the probable causes of the change of each factor. 
Mr. Stryker said: 

In making an estimate we do not assume that the influences which have caused 
the past growth of a particular area will remain unchanged. Instead, we try to pic- 
ture the conditions which will affect, first, the growth of the United States as a whole, 


and secondly its major economic divisions. With this picture in mind we try to 
give each individual state and community its place in the economic life of the nation. 
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Mr. Goodrich, in his paper, has approached the population of New York City 
from a United States standpoint, but it seems to me that he has treated both New 
York City and the United States from a viewpoint purely statistical rather than 
from one having also an economic background. 

In our work, and I suppose this is true in other lines of endeavor, it is necessary to 
have more than mere totals. We need to have information as to the character of 
population, distribution of population, the number of families which are represented 
by this population, the type and number of business firms which are in existence 
because of this population. These details can be estimated only after the causes of 
growth are understood for the past and present. 


Dr. Walter Laidlaw, Executive Secretary and Director of Research of the Cities 
Census Committee, was the next speaker. He expressed himself as being somewhat 
skeptical of the possibility of predicting population growth by the projection of mathe- 
matical curves of the type used by Pearl and Reed. In his mind, it is a fact much 
more significant that the percentage of growth of New York City has been steadily 
diminishing decade by decade. He also pointed out that, during the last ten years, 
net migration into New York City from southern and eastern Europe has been large 
enough to account for practically the entire growth of the city, and that this migration 
is now largely cut off by the new immigration law. It seems to him, therefore, that 
the rate of growth in the future is likely to be distinctly less than it has been to date. 

Dr. Raymond Pearl, Director of the Institute for Biological Research at Johns 
Hopkins University, next took the floor. He told how he and Dr. Reed had worked 
out their theory of population growth and described the four stages in the develop- 
ment of this research, or as he put it 

The first working hypothesis was that biologically the growth of a population is 

fundamentally like the growth of an individual animal or plant. He then formu- 

lated this mathematically into what has come to be known as the generalized ‘‘lo- 
gisticlaw” of population growth. The actual statistical test has demonstrated that 
the growth of every human population of large size for which there is sufficient 

census data follows the Logistic Law. This test has been applied to the world as a 

whole and to the principal countries, as well as to such cities as New York, Balti- 

more, London, Paris, etc. In the laboratory tests, the growth of population of 
lower animals, such as flies, has been studied under controlled conditions and found 
to follow the law exactly. 


Discussing the future population of New York City, Dr. Pearl stated that, 


although in its past history this population had followed the Logistic Law, it was 
impossible to assert it would necessarily continue to do so in the future. New social 
or economic forces might come in and alter the direction of the curve. Predictions 
of the future population of large urban aggregates over more than a short period of 
years in the immediate future must always contain a large element of guess-work. 


Besides continuing his work on population, Dr. Pear] is investigating the biology of 
longevity and the biological aspects of alcohol. 

Mr. William M. Steuart, Director of the United States Census Bureau, described 
the interest taken by the American people in the question of the growth of city popu- 
lation and read typical extracts from letters received regarding their estimates for in- 
tercensal years. He states that it had been impossible to find any uniform method 
of estimating the population of various cities, and that, therefore, the Bureau of the 
Census had been driven to the simple method of projecting a straight line trend of 
population passing through the last two census figures. At present, however, the 
Bureau is planning to begin in 1927 to base its estimates of the population of the 
United States as a whole on the reported figures on births and deaths in the registration 
area and upon the migration records. An effort will be made to apportion the total 
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thus obtained between the different states. It does not seem feasible, however, to 
carry the apportionment. down to the cities. It is possible that the Bureau will drop 
the city estimates entirely. 

The discussion was concluded by brief remarks by Dr. Thomas Adams, Mr. Robert 
Pope and Mr. Vitautas Graicunas. The meeting then adjourned. 


United States Bureau of Labor Statistics.—Mr. Herman L. Amiss, for several years 
past the able editor of the Monthly Labor Review, died at his home in Washington on 
September 17, 1925, after a protracted illness. 

The Commissioner of Labor Statistics, Ethelbert Stewart, the Assistant Commis- 
sioner, Chas. E. Baldwin, and Lucian W. Chaney of the Bureau attended the meetings 
of the Association of Governmental labor Officials and the International Association of 
Industrial Accidents Boards and Commissions, both of which organizations met in 
Salt Lake City in August. 

The Bureau is engaged in compiling data concerning wages and hours of labor in 
bituminous and anthracite coal mines and in the pottery, lumber, foundry and ma- 
chine shop, and paper box board industries. A compilation of union wage scales as of 
May, 1925 is also being made. 

Work is in progress on the subject of old age pension plans of employers, dependents 
of workmen injured in industrial accidents, welfare work by employers, poor farm con- 
ditions, and efficiency in cotton manufacturing. A trade union handbook is in course 
of preparation. Current work is being done in the fields of labor legislation and in- 
dustrial accidents. The plans are under way for another wage study in the slaughter- 
ing and meat packing industry in the latter part of the year. 


Pennsylvania State Department of Welfare.—Mr. Emil Frankel, formerly Statis- 
tical Expert of the United States Department of Labor, has been appointed Statisti- 
cian and Director of Research of the Pennsylvania State Department of Welfare. 
Three major studies have been completed. 

(1) Poor relief in Pennsylvania. A state-wide survey abounding with statistical 
material regarding poor relief taxation and expenditures, outdoor relief recipients and 
so forth. 

(2) Detailed financial and population statistics covering all of Pennsylvania’s 
publie social welfare agencies. 

(3) Comprehensive survey of Pennsylvania state-aided hospitals, presenting de- 
tailed statistics regarding costs, utilization and free service. 

Mr. Frankel is developing a social index (dependency, delinquency, mental defect, 
etc.) which is intended to serve both as a quantitative measurement of the socia! prob- 
lems of the whole state, and as an index of seasonal variations in social phenomena. 


Weekly Price Index for Italy—The Chamber of Commerce of Milan has begun the 
publication of a weekly index of wholesale prices for Italy. It is computed by the 
same methods as their monthly index, which dates from May, 1921, namely, the geo- 
metrical average of prices of 125 commodities weighted by a system of multiple quota- 
tions. On the basis 1913 =100, the index for the first week in September was 685.7 
and for the second 676.5. 


Mr. Joseph E. Pogue addressed the National Petroleum Association in Atlantic 
City on September 18th on the topic: “Economic law-breaking in the petroleum re- 


fining industry.” 
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Dr. Claude L. Benner has finished his study on the Federal Intermediate Credit 
System for the Institute of Economics and is leaving to take charge of the work in 
economics at Delaware. In addition to teaching in the University he is to have 
charge of the research work in agricultural economics as provided under the Purnell 
bill. 


MEMBERS ELECTED SINCE SEPTEMBER, 1925 

Abel, Theodore F., Department of Sociology, University of Illinois, Urbana, III. 

Arndt, Ernst H. D., Professor at Transvaal University College, Pretoria, Union of 
South Africa. 

Bauer, Charles August, Southern Pine Association, 704 Interstate Bank Bldg., New 
Orleans, La. 

Braghin, Alexander Paul, Statistical Dept., Bernard Hewitt Co., 900 West Van Buren 
Street, Chicago, III. 

Braun, Elmer W., Graduate Student, Dept. of Agricultural Economics, University 
of Minnesota, St. Paul, Minn. 

Boldrini, Marcello, Professor at Universita Cattolica, Milano, Italy. 

Cochran, Mrs. R., Statistical Bureau, 206 Thomas Bldg., Dallas, Texas. 

Cole, Clarence C., Fellow in Statistical Research, Denver Chamber of Commerce, 
1430 East 18th Avenue, Denver, Colorado. 

Cook, Coleman F., Research Section, U. S. Civil Service Commission, 1724 F Street, 
N. W., Washington, D. C. 

Elsasser, Robert William, Professor in Economics, Dartmouth College, Hanover, 
N. H. 

Finch, Donald T., Student, School of Accounts and Finance, University of Denver, 
Denver, Colorado. 

Fisk, Helen I., New York Charity Organization Society, 105 East 22nd Street, New 
York, N. Y. 

Gardner, Kelsey B., U. S. Bureau of Agricultural Economics, Washington, D. C. 

Hamilton, Allegra E., Bell Telephone Laboratories, 463 West Street, New York, 
N. Y. 

Haskins, Charles Nelson, Professor of Mathematics, Dartmouth College, Hanover, 
N. H. 

Heck, Arch Oliver, Assistant Professor of School Statistics, Page Hall, Ohio State 
University, Columbus, O. 

Herman, Elliott Francis, American Telephone and Telegraph Co., 195 Broadway, 
New York, N. Y. 

Hotelling, Dr. Harold, Food Research Institute, Stanford University, California. 

Jenkins, Dr. Thos. N., National Research Fellow, Columbia University, New York, 
N. Y. 

Kantor, Harry S., 945 Jackson Avenue, New York, N. Y. 

Kennedy, Louise V., Institute of Social and Religious Research, 370—-7th Avenue, 
New York, N. Y. 

King, Frances, Cattaraugus County Health Demonstration, 411 Exchange Bank 
Building, Olean, N. Y. 

Knowles, Arthur T., Westinghouse Commercial Investment Co., 150 Broadway, 
New York, N. Y. 

McConnell, Frank A., Teachers Insurance and Annuity Association, 522 Fifth Ave- 
nue, New York, N. Y. 

Olsen, L., International Banking Corp., 60 Wall Street, New York, N. Y. 
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hem, Penn. 

Zugaro, Fulvio, Direttore Generale, Ministero Guerra (Servizi hegistia), Roma, Italy. 





576 American Statistical Association [110 


REVIEWS 


Common Stocks as a Long Term Investment, by Edgar Lawrence Smith. New 
York: MacMillan Company. 1924. 129 pp. 

Principles of Investment, by John Emmett Kirshman. New York: A. W. Shaw 
Company. 1924. 902 pp. 

Valuation of Industrial Securities, by Ralph Eastman Badger. New York: 
Prentice-Hall, Inc. 1924. 188 pp. 

The study by Edgar Lawrence Smith is an original and important contri- 
bution to the understanding of the problems of investments. Mr. Smith comes 
to the conclusion that stocks have a large advantage over bonds as a medium for 
long term investment, and particularly for the private investor as compared 
with a financial institution. He disposes effectively of the tradition that bonds, 
and especially first mortgage bonds, are the best type of investment. 

These conclusions are based upon a series of statistical tests covering various 
periods from 1866 to 1922. Generally the period selected is approximately 
20 years. In making his eleven tests, Mr. Smith has used a modified form of 
random sampling. Both as regards the period of time selected and the choice 
of securities, precaution has been taken to eliminate bias in favor of stocks. 
In some instances, though deliberate attempt was made to choose such dates as 
would favor bonds, 7. e. when commodity prices were falling, the results favored 
stocks over bonds. 

Common stocks, according to Mr. Smith, are better than bonds as long term 
investments, because of an essential difference between a share of stock and a 
bond. A bond is an obligation which is satisfied by periodic payments of fixed 
interest, and of a fixed principal at maturity. It is therefore subject to change 
in value from changes in the purchasing power of the dollar. The value of a 
share of stock on the other hand is determined by earning power—in other words, 
it shares in the growth and prosperity of the country; in the industry represented; 
and in appreciation of commodity prices. Bonds tend to rise when prices are 
falling and decline when prices are rising. Stocks, on the contrary, tend to rise 
and fall with the rise and fall in prices. But when prices are declining and 
stocks would be expected to fall, the factors just mentioned often more than 
offset the tendency to decline. 

A diversified list of stocks is more likely to afford the investor a greater margin 
of safety, to make up‘for loss sustained in making a bad choice, than a similarly 
diversified list of bonds. Bonds being obligations to secure the payment of a 
specific sum at maturity cannot appreciate much above par, even if earnings 
are extraordinarily large. Should earnings decline, bonds may depreciate 
considerably and may, in fact, become absolutely worthless. While stocks may 
also decline in value under adverse conditions, should the earnings become very 
large the value of the stock may rise to many times par. Thus, if a wrong choice 
is made of one bond among a group of bonds purchased, the loss may not easily 
be made up from increases in the prices of the other bonds on the list. In the 
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ease of stocks, however, while there may be more risk in an individual issue, 
losses sustained may be overcome by gains from stocks wisely chosen. 

Moreover, as Mr. Smith points out, the self interest of the management is in 
the common stock. The bondholders are outsiders. When, therefore, an 
enterprise is successful bonds are redeemed, if possible, before maturity, and 
thus the position of the common stockholder improved. 

Another factor making for a greater return from stocks is the fact that in well 
managed concerns, at the beginning at least, all of the earnings are not paid 
out in dividends. The surplus earnings are put back into the property and the 
value of the stockholders’ equity increases at a compound interest rate. Earnings 
are correspondingly increased. Mr. Smith finds that the rate of increase is 
two and one-half per cent compounded annually and that the market does not 
fully discount this condition. 

The period studied covers an era of great increase in population with even a 
larger expansion in industrial activity in the United States. Are these factors 
a partial explanation of the increase in common stock values and may the same 
ate of increase be expected in the future? Mr. Smith holds that the growth of 
industry is likely to continue and thus still favor stocks. 

Mr. Smith’s book is a very significant and valuable study. He has done much 
to direct attention to some essential differences between stocks and bonds. His 
book shows the dangers of judging the value of securities by their names, 7. e. 
first mortgage, debenture, preferred stock, common stock. He emphasizes the 
essentials of real values as opposed to traditional values. 


Principles of Investment, by Dr. Kirshman, is an encyclopedic book. It consists 
of four parts, the most important of which is Part 2. This section of the book 
deals with the question of investment risk. Dr. Kirshman’s analysis of invest- 
ment credit includes chapters on the contract, 7. e. the kind of legal obligation, 
the asset, the income, and the good will elements of credit. 

He places much stress on assets and holds that there is much overemphasis 
of the importance of earnings. In this view he differs from most recent writers 
on investments. 

Secured loans, to be sure, are safer than unsecured loans to the same individual 
or corporation. But assets are of value only in the event of liquidation and the 
investor should be brought to realize that physical assets of a company in liquid- 
ation cannot easily be diverted to other uses and made saleable for a reasonable 
proportion of the asset value. Assets without earnings usually mean low value 
of security, while a security on the other hand would be quite safe if there be 
continued large earnings and comparatively little assets. 

Dr. Kirshman gives considerable historical and descriptive material of value 
to the student and the general reader in his chapters on the various types of 
securities. Part 4 and Part 5, on the movement of security prices and invest- 
ment policy, constitute a restatement of second-hand information and, in the 
opinion of the reviewer, would be of little practical assistance to an investor. 

The book is unnecessarily long. The volume could easily stand the elimination 
of the author’s views on surtaxes, government ownership of railroads, Bolshevism, 
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etc., which have no practical application to the problem of determining what 
constitutes a safe and profitable investment. 

In an appendix is given the definition of various terms used in the investment 
field. This is well done and is a useful addition to the volume. 


Dr. Badger has worked out in his volume on Valuation of Industrial Securities 
certain empirical ratios between earnings and market value of industrial securi- 
ties. He offers these ratios as a method for determining the value of securities 
which for various reasons do not have a market. 

Dr. Badger argues that in the valuation of industrial securities as distinguished 
from public utility securities, earnings are by far the most important factor in 
the determination of their value. Book value or assets are only of significance 
in the cases of corporations which face possible foreclosure or reorganization 
proceedings because of inadequate earnings. 

H. K. Herwirz 


New York City 


The Stabilization of the Price of Wheat During the War and its Effect Upon the 
Returns of the Producer, by Frank M. Surface. Washington: U.S. Grain 
Corporation. 1925. 100 pp. 

This report appears to owe its existence principally to the desire of certain 
members of President Wilson’s Committee to Determine a Fair Price, and of the 
former United States Food Administration, to clear themselves, by appeal to 
the record, from the charge of having dealt unfairly with the wheat farmers. 
Largely for this purpose it gives a concise and well documented history of the 
work of the United States Grain Corporation and marshals effectively a mass of 
facts already known, but it does not disclose many new facts. 

The Government exercised control of the wheat market from August, 1917, to 
the end of May, 1920, and this is the period to which this report relates. Its 
principal conclusion, which is carefully reasoned, is as follows (page 25): “By 
every measure which we can apply the wheat farmer in the United States was 
extremely prosperous during these three crop years. This prosperity was the 
direct result of the policies pursued by the Food Administration and the Grain 
Corporation.” 

The two principal evidences of the prosperity that are adduced in the report are 
that the prevailing prices for wheat induced producers to extend by about one- 
half the acreage devoted to that crop, which in turn brought about a great in- 
crease in the market value of wheat lands. These developments could hardly 
have taken place, as the report remarks, “‘if wheat growing had not been ex- 
ceptionally profitable” to the producer. It is, of course, true to some extent 
that wheat growers increased their output, expanded their acreage, and paid 
high prices for wheat land under the influence of non-pecuniary patriotic motives, 
but the report exhibits independent evidence to show that there is no logical 
necessity for emphasizing this influence. In particular, figures of the Department 
of Agriculture show that the farmers realized during the period the largest mar- 
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gins ever recorded between the cost of production (exclusive of land rent) and 
the farm value of wheat. The estimated total profits of wheat growers based on 
these figures and on the size of the crop are also given, and they appear to be 
conclusive. 

The proof advanced for the proposition that this prosperity was the direct 
result of the policies pursued by the Government is not quite so convincing. 
The principal argument, in substance, is that except for the intervention of the 
United States Government, the Allied Governments, through their consolidated 
buying agency, would have paid, and consequently would have established, a 
market price below the basic price ($2.20) that was in fact guaranteed, because 
these governments had in 1917 fixed prices of wheat in their own countries at 
about $1.80 per bushel and their agents were then contending that they should 
pay the American farmer about $1.50 per bushel. They had of course paid as 
much as $3.00 per bushel for some American wheat earlier in 1917, but this was 
before they unified their purchasing. The need for wheat was so urgent, and 
the obstacles to obtaining it from sources more remote than the United States 
were so great, that citation of their intention to buy (if possible) at $1.50 can 
scarcely be regarded as proving that they could have made their views prevail 
if there had been no intervention by the Government. A considerable part of 
any higher price that foreigners might have had to pay, however, in a market 
left to itself under war conditions and fluctuating wildly from time to time, would 
doubtless have gone to speculators and other middlemen instead of to farmers. 
The influence of the Government, as this report shows, was exerted toward in- 
suring reflection back to the farmer of as large a proportion as possible of the price 
paid for wheat at terminal markets and for flour at retail stores. The evidence 
adduced shows that middlemen’s margins did not in fact rise much above pre- 
war levels and that Government regulation was largely responsible for this, 
partly by reducing the hazards incidental to fluctuating prices, but even more by 
stipulating the margins. 

The principal fresh information contained in the report is that drawn from 
the minutes of the meetings held in August, 1917, by the committee that deter- 
mined the basic price of wheat. It appears (page 44) that the estimates of the 
current cost of producing wheat upon which the committee proceeded were much 
higher than those which later investigation found to have prevailed, and also 
that the committee was deadlocked for several days while different members 
contended for prices varying from $1.84 to $2.50. 

Car. E. Parry 

Federal Reserve Board, 

Washington, D. C. 


The Regularization of Employment. A Study in the Prevention of Unemployment, 
by Herman Feldman, Ph.D. New York: Harper & Brothers. 1925. 437 pp. 
Mr. Feldman was employed for a brief period in 1921 by the Economic Ad- 

visory Commission of the President’s Unemployment Conference, to study 

methods adopted in industry to prevent unemployment. Impressed with the 
importance of determining the practical possibilities of prevention of unemploy- 
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ment through a better technique of administration, he decided to continue his in- 
vestigation, and the present study, published under the auspices of the American 
Management Association, is the result. 

The author is concerned not with the relative amount of unemployment, but 
with the broader and more basic problem of the insecurity of employment, a 
problem which becomes increasingly serious with the interdependence of in- 
dustries and the increasingly powerful réle which style changes play in modern 
life. Employment is becoming affected by seasonal changes in more industries 
and to a greater degree than ever before. Recent studies, cited, indicate that 
cyclical depressions have increased in frequency with no diminution in intensity. 
The problem of casual labor, although it has been practically overcome by a few 
progressive firms, shows no indication of becoming, as a whole, less acute. 

Unemployment is not the whole evil resulting from irregularity of employment. 
Irregularity affects the employer and the consumer as well as the worker. From 
the employer’s point of view, discontinuous production results in higher labor 
cost, not only in increased compensation for irregular work, and impairment of 
skill, but also in reluctance of the employee to give his fullest codperation in pro- 
duction. It results in increased overhead costs, in greater material cost, due to 
money investment lost in material not put into production and to depreciation 
and waste of such materials, and it results in increased cost of labor turnover. 
In large measure, the losses of irregular operation are shifted to the consumer who 
pays a higher price for the product. Irregularity affects not only the income of 
the worker, but also his health, his efficiency and his whole attitude toward in- 
dustry and toward life. It thus becomes a social problem as well as an industrial 
one. The author discusses in some detail irregularity of employment as a cause 
of pauperism—its relation to the health of the worker and to his liability to acci- 
dent. 

By analysis and comparison of the aggregate amount of unemployment due to 
cyclical depressions and to dislocations produced by seasonal and daily fluctua- 
tions throughout the year, the author concludes that chronic unemployment totals 
larger than the unemployment of the industrial crisis. This conclusion, together 
with the fact that experience in dealing with seasonal fluctuations is essential to 
an effective handling of the more difficult problem of reducing cyclical fluctua- 
tions, led the author to believe that more immediate and practical progress lies in 
the regularization of the more chronic types of fluctuations. 

The main responsibility for the present situation and for its alleviation is 
placed upon management. ‘One of the chief causes of irregularity of employ- 
ment in the past has been the indifference of management, ” and the main empha- 
sis throughout the book is upon prevention of unemployment, through the pro- 
vision of regular work in industry. 

The major portion of the book is devoted to a discussion of the possibilities of 
regularization through improvements in the management of distribution, pro- 

duction and personnel within the individual plant or industry. Principles of 
business management applicable in the regularization of employment are dis- 
cussed at length with illustrations from the experience, successful or otherwise, of 
one or more American firms, with some discussion of theoretical possibilities. 
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Most of the information was obtained by first hand investigation, by the author, 
or by correspondence with the responsible heads of firms. 

Distribution technique which will aid regularity of employment is given first 
place in the discussion because all other methods of steadying work depend chiefly 
upon the control achieved over demand. In “‘distribution technique” is included 
not only sales problems, but also the technique of planning distribution (distribu- 
tion engineering) not properly included in selling. Neglect of such technique has 
been quite general due in part to the belief that distribution cannot be controlled, 
and in part to the fact that distribution activities have not been motivated by a 
desire to provide continuous production. Modern administration, however, has 
opened up a new vision concerning the opportunities at hand for individual firms 
to anticipate and control outside circumstances which are apt to disturb them, 
and numerous instances where regularization of demand has been accomplished 
are cited. Some of the elements of distribution technique which may prove of 
aid in eliminating uneven production in the factory are topically summarized as 
follows: 

(1) Distribution research. 

(2) Diversifying of output. 

(3) Reducing excessive variations of the same product. 

(4) Modifying the extremes of style changes. 

(5) Getting the consumer to buy more regularly. 

(6) Securing orders well in advance of need from dealers. 

(7) Developing long-range policies of business expansion. 

(8) Coérdinating sales, production and other factors through scientific sales 

planning. 

Such distribution technique can be applied to small businesses as well as large 
and to various types of industries. The effectiveness of any particular expedient 
will vary according to special circumstances and according to the judgment used 
in its application. 

Beneficial results of scientific control of demand can be nullified, however, by 
improper production organization. The advantages of scientific production 
planning in steadying employment frequently have been pointed out by other 
writers and the author has limited himself to helpful suggestions on some of the 
more neglected aspects of production engineering which are of particular aid, such 
as overcoming the weather factor, utilizing the dull period, adjusting hours of 
work and manufacturing for stock. 

The third essential to regularization by individual plants is the proper planning 
of personnel requirements involving a varied use of the labor force, with the plant 
employment department as one of the first administrative requisites. In his 
discussion of labor policies, the author has been constructive and considerate of 
the point of view of both employers and unions. He has indicated the points of 
difference and the ways in which both can codéperate. For example, labor’s ob- 
jection to the introduction of new machinery—always a troublesome problem— 
has been based largely upon a desire to conserve employment. This objection 
can be removed by forethought on the part of management. The adjustments 
necessary in introducing new machinery can be made gradually by ceasing to hire 
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new workers, by training for the new job those who would otherwise be dismissed 
and by similar arrangements, so that it will be unnecessary to lay off workers. 
“Employers who do not show consideration in planning adjustments of labor re- 
quirements when new machinery is introduced cannot with reason protest against 
restriction which trade unions have built up in this matter.”” He concludes: 
“Devices which make it possible for an extremely irregular industry to get along 
with smaller labor reserves can be considered beneficent, however, only in so far 
as sudden disturbances are avoided and labor is directed gradually into more 
regular industries. On the other hand, only those who have the crude theory 
that everything which makes more work is economically desirable, whether the 
energy is used in wasteful methods of manufacturing or in manufacturing wasteful 
things, can oppose labor-saving devices, humanely introduced.” 

There is a helpful discussion of the question of the desirability of equal distribu- 
tion of work, the attitude of the trade unions toward it, and the conditions upon 
which equal distribution in various industries should depend, such as the greater 
differences in efficiency among workmen; value other than efficiency of certain 
workmen; expense and difficulty incurred in recruiting the working force in a 
busy season; overhead charges, etc. From the author’s point of view, to the ex- 
tent that seasonal, cyclical and other types of unemployment may be avoided by 
intelligent anticipation of these recurrent evils, the use of part time work as a 
means of overcoming the effects is a makeshift. It is better that work be made 
regular for a large proportion and irregular for a few wage earners than that every- 
one should be insecure in his job. 

The lack of coéperation between labor and management to the end that em- 
ployment may be secure, the author ascribes in general to the antagonism of em- 
ployers and to narrow minded trade union leadership. Specific ways in which 
the attitude of both can be more constructive are suggested. It is the author’s 
belief that labor, instead of placing most hope in provision of public work by 
government, national and state, should place more emphasis upon the regularity 
of work within each industry. He suggests that an effective way in which unions 
can aid in regularization is by agreement to differentiation of wage rates in an in- 
dustry in accordance with the regularity of work offered by individual employers; 
or, better, insistence upon unemployment insurance plans which provide financial 
reward to efficient employers. 

Systematic training and transfer of employees within the plant are indispens- 
able to regularization of employment and the author advocates not only specific 
job instruction, but also training for versatility which will make possible transfer 
of workers from one job to another. Experiences in this respect of such firms as 
Goodyear Tire and Rubber Co., The Plimpton Press, and Joseph and Feiss are 
noted. The matter of transfer is more troublesome because of the workers’ lack 
of interest, and probable inefficiency on temporary work and because of the ob- 
jection of the unions, particularly the craft unions. 

The book is made more practical by the discussion of the possibilities of re- 
ducing fluctuations in production and employment in four of the most seasonal 
industries, namely, construction, coal mining, longshore work and farming. In- 
vestigations have shown that irregular employment in these industries is 
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caused largely by obvious faults of economic organization and managerial tech- 
nique. 

The matter of the codrdination of the labor supply between industries, the 
author suggests, should be handled by employment exchanges administered 
jointly by employers and unions in those industries in which responsible unions 
are recognized (e.g. construction, mining). In case no recognized union exists, 
employers can do a great deal to codrdinate labor requirements. The function 
of the public employment agencies should be, not an exclusive employment system 
but rather a final codrdination or clearance of labor between trades having their 
own systems. They should be primary agencies only for unorganized trades, for 
casual work, and for miscellaneous jobs and such special reserves as are required 
infarming. They should also become the most efficient information centers with 
regard to employment, since neither employers nor union agencies can be com- 
mended in this respect; the former being too much interested in a large labor re- 
serve, the latter, on the other hand, in keeping workers out of their locality lest 
the market become overstocked. 

Systematic dovetailing of employment through employment exchanges, usually 
assumed to be an important solution of the unemployment problem, the author 
discusses at some length and concludes to be of limited application. The main 
effort should be to eliminate the need for mobility, rather than to encourage it. 
Substitutes for mobility are suggested. ‘The final thing to be said concerning 
public employment agencies is that, after all, they are superficial agencies. They 
facilitate, they mitigate, but they do not do much to eliminate.” 

Unemployment insurance plans have proven a great aid to regularization. 
If properly drawn, they furnish the necessary financial incentive and tend to 
reduce unemployment toa minimum. That they also have numerous business 
advantages is claimed by some firms which have tried them. The unemployment 
insurance principle is already accepted in indirect form in such adjustments as 
payment of a higher wage for temporary work. Insurance schemes are therefore 
an extension of an accepted principle, securing important economic advantages, 
and allowing for more effective administration. Discussion of unemployment 
compensation is made more concrete by analysis and comparison of four types of 
American plans—the Dennison plan, the Proctor and Gambie plan, the Cleveland 
plan of the International Ladies’ Garment Workers’ Union and the Cleveland 
clothing manufacturers, and the Chicago market plan of the Amalgamated 
Clothing Workers and the Chicago Federation of Clothing Manufacturers. 
Of these the Chicago plan is criticized on the ground that it lacks a strong incen- 
tive to regularize employment; the Dennison plan does not protect the worker 
against arbitrary dismissal. 

Mr. Feldman proposes an original contributory plan of unemployment in- 
surance combining in a practical way the two basic principles of an insurance 
scheme—prevention and compensation. Under his plan the contributions of 
employers and workers would be kept separate, the employer’s part of the joint 
fund to be used to compensate for all short periods of unemployment—the first 
4 or 8 weeks of unemployment in a year, the workers’ part to be used for all 
unemployment beyond that, as in periods of great business depression. ‘Thus 
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the employer would have every incentive to regularize employment, especially 
with regard to the ordinary seasonal and short-time fluctuations, while for periods 
of great depression or disaster which could not be averted by his efforts, there 
would be reserve funds in the form of employees’ savings.” 

Some of his readers may not quite share the author’s enthusiasm for the success 
of the unemployment insurance plans now being tried out in this country. Un- 
doubtedly each of them has some of the beneficial features pointed out and the 
principle has been generally successful. As relief measures, however, the amount 
paid in benefits is, in many cases, wholly inadequate to meet the workers’ need, 
and it is a question whether or not the majority of firms or of markets could sup- 
port a plan which would pay adequate benefits, particularly in a depressed period 
when competition is keen, unless the entire industry should support the plan or 
unless it is subsidized from some outside source. In some instances the funds 
have been inadequate during periods of depression to meet even the meager bene- 
fit provided in the plan. However, the fact that in such instances the plans have 
been reorganized or benefits stopped until the funds could recuperate, and in no 
case, to our knowledge, have been dropped altogether, speaks well for the success 
of the principle. 

The present need, the author points out in conclusion, is not the development 
of new or different ideas for combatting unemployment, but to put into practice 
the ideas already accepted. In this, the responsibility falls upon everyone. 

Some of the remedies suggested seem at first a little utopian, but the author 
does not suggest that it all can be accomplished in a day, nor does he anticipate a 
complete solution. ‘‘What is hoped for is that enough progress will be made in 
time to leave a residue that would concentrate the problem on the genuinely re- 
quired reserve of unemployed and the genuinely unemployable.” 

Although written from the point of view of the student, the business executive 
will find in this book many constructive suggestions sufficiently detailed to be of 
practical value. The student will find it readable and time saving in that it 
brings together from widely scattered sources much valuable material on the sub- 
ject, with adequate footnotes for reference to the original matter. It is written 
with a simplicity and clarity which will make it interesting to the non-technical 
reader. 

MARGARET GADSBY 

Forest Hills, L. I. 


A Definition of Income and Its Application in Federal Taxation, by William 
Wallace Hewett,: Ph.D. Philadelphia: Westbrook Publishing Co. 1925. 
91 pp. 

This book sets forth in a commendably clear manner a number of the leading 
problems involved in the definition of income, and also outlines briefly but in- 
telligibly the development of the legal definition of income, as exemplified by the 
decisions of the Federal courts. 

The author speaks favorably of the criteria of a good definition set forth by 
Professor Irving Fisher: namely, ‘It must be useful for scientific analysis; and it 
must harmonize with popular and instinctive usage”. He also endorses Sir 
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Josiah Stamp’s definition of social income and models thereafter his own, which is 
as follows: ‘Net social income is the total flow of commodities and services, 
through a given period of time, available for distribution after maintaining the 
industrial equipment intact”. This definition appears to conform reasonably 
well to both of Professor Fisher’s principles, but, when the same definition is 
modified slightly to cover individual income, it can scarcely be said to harmonize 
with either popular or instinctive usage. Net individual income is defined as: 
“The flow of commodities and services accruing to an individual through a period 
of time and available for disposition after deducting the necessary cost of ac- 
quisition.” This definition is at variance with the popular impression that in- 
dividual income is always measured in terms of money, and it fails to accord with 
either of the two concepts which, in the opinion of the reviewer, are logical and 
scientific, namely: 

1. Psychic income 

2. Services received 

Furthermore, Professor Hewett’s definition will lead him at once into a morass 
if he attempts to utilize it in actual computation, for, to obtain net income, he is 
compelled to deduct costs, and costs cannot be successfully dealt with except in 
terms of money. 

A much simpler way, in practice, of eliminating the effects of changes in the 
purchasing power of money is first to calculate the individual income in terms of 
money and then, through division by an index number, to convert it into purchas- 
ing power of the type desired. As a matter of fact, it appears that the author 
fails to adhere consistently to his definition, for, in his discussion of corporate in- 
come, he implies throughout the calculation of money income along the lines just 
described. 

A minor criticism of Professor Hewett’s discussion of individual income re- 
ceived from corporations is that he fails to take cognizance of the fact that, in 
many instances, even though large amounts of earnings are reinvested, the total 
value of the stock of the corporation grows less. Such has been the case with 
the railroads during the last quarter century. Under such circumstances, it is 
evident that the earnings reinvested are not part of the income of the stock- 
holders. 

In discussing the legal definition of income, Professor Hewett gives a summary 
of the various Federal Inceme Tax laws and the interpretations that have been 
placed upon the definitions contained therein. His view that the law of 1861 was 
intended to tax gross receipts appears to be an assumption that needs substantia- 
tion. Is it not more probable that the failure of the law to specify that the tax 
was to be levied upon net income was due merely to hasty drafting of the statute, 
rather than to any intent to use gross income as a basis for taxation? 

It is interesting to observe that the mode of calculating corporate income as 
provided by the legislation of the earlier years was much more defensible from the 
economic standpoint than that decreed in laws more recently enacted, for, in the 
earlier laws, a tax was levied upon the net earnings of the corporation, rather than 
upon the amount of dividends paid. It appears obvious that it is only new earn- 
ings, and not dividends of any kind, whether paid in cash or stock, that strengthen 
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the economic position of the stockholder. However, as the author points out, 
expediency is a matter of great consequence in taxation, and it may well be easier 
to collect taxes on dividends, than on corporate earnings. The whole concept of 
“realized income” is delightfully vague and lacking in scientific precision, even 
though it has been worked out in such a manner as to furnish a definite basis for 
tax collection. 

Perhaps, however, Dr. Hewett is putting it too strongly when he assumes that 
it is not feasible to tax accrued income. Is it not possible to compel each indi- 
vidual to file an inventory of his business assets at the beginning and at the end of 
each year, and to report on the amount of money withdrawn from his business 
during the year? Asa matter of fact, would it not be better to levy a tax on con- 
sumption, rather than on income, despite Professor Daniels’ belief that such a tax 
gives rise to “‘a prima facie case of injustice sufficient to bar a restrictive concept 
of income from being adopted in the financial glossary”’? 

Dr. Hewett states that Congress has exempted certain parts of income “be- 
cause of social justice”. His choice of this phrase to cover the reasons actuating 
the National legislature is a dubious one, for the term “justice” is hard to define. 
It would be more accurate to say that exemptions are made with a view to fur- 
thering some social policy that is deemed by those enacting the law to be desira- 
ble. The opponents of the plan will probably believe the policy adopted to be 
distinctly unjust. Likewise, the assertion that, according to the legal concept, 
gains that do not arise out of the “productive” process are not income, can be 
defended only if one interprets the term productive as being synonomous with 
acquisitive, a view that many economists will not endorse. 

Even though the reviewer differs with the author of the book in respect to some 
of the points just mentioned, this is far from saying that the book is not a worth- 
while contribution to the literature of the subject covered. It is well written and 
serves admirably to set forth the chief issues involved and to outline the progress 
made by the courts in defining income. 

Wixtrorp I. Kine 

National Bureau of Economic Research, Inc., 

New York City 


Non-Voting, Causes and Methods of Control, by Edward Charles Merriam and 
Harold Foote Gosnell. Chicago: The University of Chicago Press. 1924. 
xvi, 287 pp. 

On April 3, 1923, an important municipal election was held in Chicago. Of 
the more than 1,460,000 eligible electors only 900,000 were registered, and of 
these only 723,000 cast votes. Approximately 740,000, or more than half of the 
eligible adult citizens, failed to exercise the elective franchise. The study under 
review was undertaken in order to determine the causes of this striking political 
phenomenon. Particular interest attaches to this investigation because it rep- 
resents an attempt to apply quantitative methods in the solution of a practical 
problem in political science. 


The general inquiry covered several fields. Questionnaires were sent by mail 
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to precinct committeemen, ward committeemen and other persons prominent in 
the city, for the purpose of securing supposedly expert opinion as to the causes of 
non-voting. About 300 returns were received from this group. Other question- 
naires were circulated and filled in by agents (largely graduate students) among 
some 6,000 non-voters. As a control group, to be used in judging the results 
secured from the non-voters, a sample of about 5,000 voters was taken, and for 
this group data as to age, sex, nativity, term of residence and type of citizenship 
were secured from the records of the Election Commissioners. In addition, some 
detailed studies were made of particular precincts, and census data for ten census 
enumeration districts were employed in determining the relation between social 
and racial factors and political activity. 

The information received from the non-voters who were interviewed in person 
constitutes the bulk of the material on which the conclusions of the investigators 
are based. Chief interest, accordingly, attaches to the make-up of this sample of 
non-voters, and to the use made of the information secured from them. 

The sample finally employed numbered 5,310 persons, about three-quarters of 
one per cent of the total number of non-voters in the city. This was a small 
sample, considering the heterogeneous character of the population being studied. 
It was particularly important, therefore, that it be thoroughly representative, a 
fact recognized by the investigators. The racial distribution within the sample 
accords fairly closely with that found among the adult citizens of Chicago in 
1920, though the proportion of naturalized Italians, Russians, Poles and Czecho- 
Slovakians in the sample seems unduly large. The sex distribution in the sample 
is a bit over-balanced in favor of females. This group constituted 68 per cent of 
the sample, as compared with an estimated figure of 63 per cent of the total num- 
ber of non-votersin Chicago. The proportion of non-registered non-voters in the 
sample (64 per cent) is somewhat less than the proportion of non-registered non- 
voters in the city as a whole (75 per cent). What appears to be the most serious 
departure from a well-proportioned sample is found in the distribution of occupa- 
tions. Of the 5,310 persons in the final sample 3,035 were house-wives. These 
constitute a trifle over 57 per cent of the total group, and about 84 per cent of the 
women in the sample. The preponderant effect of this group of house-wives 
upon the results is made clear when the reasons for non-voting given by all mem- 
bers of the sample other than house-wives are separately tabulated and compared 
with the results for the entire group. The authors suggest that the great number 
of house-wives in the sample of non-voters may be due to the way in which the 
interviewers did their work. Presumably the housewives were most readily ac- 
cessible to door-to-door canvassers. Certainly the weight given to this single 
group requires justification before the sample may be accepted as representative 
in the matter of occupational distribution. 

The reasons for not voting given by the persons interviewed are listed under 
some twenty heads. General indifference is found to be the chief reason for non- 
voting, with illness, absence from the city and neglect standing next in order. 
(When house-wives are excluded, the four most important reasons are absence 
from the city, general indifference, fear of loss of business or wages, and neglect.) 
The possibility of evasion or rationalization in the answers made by persons inter- 
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viewed is recognized, but the investigators believe that on the whole trustworthy 
answers were received. 

The discussion of causes of non-voting is supplemented by a chapter on 
methods of stimulating and depressing voting tendencies which will be of interest 
to students of political science. 

As an effort ‘‘to present a mass of concrete material . . . on the basis of which 
students of politics may get behind some of the current notions as to the way in 
which democratic government is working” this study is to be welcomed. That it 
falls somewhat short of perfection as an example of clear-cut quantitative analysis 
is perhaps attributable largely to the nature of the material dealt with. Yet 
the reviewer feels that something more could have been done in giving precision 
to such statements as ‘‘there was a distinct relation between the reasons given 
and such factors as sex, registration status, nationality, age, term of residence in 
the city and economic status”’ (p. 51). What constitutes a distinct relation? 
And, in comparing the frequencies recorded for different occupational or racial 
groups of non-voters, or the age or race frequencies found in the samples of voters 
and non-voters, what constitutes a significant difference? In a statistical study 
one expects more attention to such questions than is found here. For such a 
study, too, one feels that there is present some degree of redundancy in the textual 
matter explaining and elaborating upon the tables of results. But, in spite of 
these defects, this is a significant study. Working with material difficult to fit 
into the molds commonly employed in statistical analysis, the authors have made 
a promising beginning in the employment of quantitative tools in this new field 
of political study. That more will be done cannot be doubted, and it is certain 
that later investigators will profit by the results of this pioneer study. 

FREDERICK C. MILLS 


The Negro Year Book, 1925-1926. Edited by Monroe N. Work. Tuskegee, 
Alabama: Negro Year Book Publishing Company. 1925. 544 pp. $1.00, 
paper. 

All students of present-day social problems will welcome the latest (seventh) 
edition of the Negro Year Book, which covers the years 1922-1924. It represents 
a full presentation of Negro life in America, and while it may be charged in places 
with special pleading, this is not true where actual statistics are concerned, and is, 
where it occurs, so obvious that it cannot be said to detract from the value of the 
work. 

Perhaps the outstanding facts that come out of going through it are the amaz- 
ingly complete acculturation of the Negro to the American culture, and the re- 
markable development he has made since the time of emancipation from slavery. 
The very first table in the book, showing how Negroes own some 688,000 homes 
today as against 12,000 owned in 1866, how they conduct some 70,000 businesses 
when they conducted but 2,100 then, how the percentage of literacy has gone up 
from 10 then to 90 now, and similar increases in other phases of their economic, 
educational, and religious position, is merely a forerunner of other tables which 
show similar gains. Going through the work, we find accounts, to select at ran- 
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dom, of the transactions of Negro fire and life insurance companies, of inventions 
of various sorts by Negroes, of large new churches completed in these two years, 
and increasingly large amounts of property owned by Negro religious organiza- 
tions, of the growing number of Negro schools, both secondary and higher, with 
increasing Negro support for their own education, of the large number of Negroes 
winning prizes in public schools and the Negroes elected to Phi Beta Kappa in 
1922-24, of the attainments in the arts and sciences of Negroes, and books of 
various sorts published by them, of the increasing numbers of independent 
farmers and business men to be found among them today, of the decreasing 
death-rate and the increasing number of Negro physicians and doctors—a re- 
markable record. 

Besides its purely statistical side, there is much of value to the student who is 
interested in the Negro from other angles. There is an excellent discussion of the 
status of the Negro in the various parts of Africa, and there is also a special bibli- 
ography on the African field which will be of value to anyone wishing to go further 
into either the history, the ethnology, or the present condition of the Negro there. 
Space is also given to the West Indian Islanders. The historical section is well 
put together. There are discussions of the condition of the Negro in slavery 
days—of the status of the free Negro, whose numbers and importance are gen- 
erally overlooked in discussions of the history of the period. An interesting dis- 
cussion is also given of the activities of the Abolitionists and of those Negroes who 
were active in the operation of the underground slave route. The recent migra- 
tions also come in for consideration, and, in view of the attention which this phe- 
nomenon has attracted, it is of great interest to learn that, in 1920, the percent- 
ages of Negroes living in states other than those in which they were born was only 
19.9, as against 16.6 in 1910 and 15.6 in 1900. 

The analysis of population statistics is as thorough as any in the book, and 
affords a wealth of concise material. After a presentation of the population of 
the Negro in the census years, there is given, in handy form, the census figures for 
the black and mulatto population for the years in which this has been noted, with 
the increase of each, the age distribution of Negroes, 1920, the classification of 
Negroes by sex and color since 1860, their marital condition, total and Negro 
population by states and the respective percentages of Negro population to the 
total for states, 1920, ete. It is somewhat surprising to learn that the proportion 
of mulatto to black decreased in the decade 1910-1920. In the former year it 
was 20.9 per cent mulatto, in the latter 15.9. The Census report is quoted to 
attribute this to the employment of more Negro enumerators, who were more 
careful than white enumerators to list according to color. But it is difficult for 
the writer, who has collected genealogical statements from more than 500 adult 
Negroes, of whom only 20 per cent claim to be full-blooded Negroes with no ad- 
mixture of white or Indian blood, to place reliance on the census statements 
quoted here, particularly since the factor of overlapping in skin color is so great 
that the determination of ancestry by observation of pigmentation is well-nigh 
impossible. 

There is an excellent analysis of the lynching situation, and the data, which are 
original with the editor of the Year Book, are conducive to the hope that this evil 
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may be on the wane. In the year 1924, there were reported no lynchings of 
whites, and 16 of Negroes, as against 4 of whites, and 29 of Negroes in 1923, 6 of 
whites and 51 of Negroes in 1922, and so on in increasing numbers back as far as 
1892, when there were 100 whites and 155 Negroes killed in this fashion (the 
largest number lynched in any one year recorded), and before which the number 
varied between 100 and 200 per year. But the appalling total of 4203 persons 
who have been reported as murdered in this way (1038 white, 3165 Negro) gives 
ample food for thought as to the need for an attempt to curb this particular mos 
in American life. 

The bibliographic sections, scattered throughout the book, are worthy of note, 
and should also be useful. Aside from the one already mentioned on the various 
phases of Africa, there are two long lists of books produced by Negro authors and 
by white authors about Negroes in the years 1922-24, and at the end of the beok 
there are 37 pages of small type entitled “A Bibliography of the Negro in the 
United States.” The headings indicate its completeness—general bibliographies, 
particular bibliographies, the Negro and slavery, Civil War and emancipation, 
reconstruction, present conditions, special studies and publications, folk songs 
and folklore, literature and art, some books by Negro writers, and articles in cur- 
rent periodicals on the Negro, 1914-1924, in which is included a long list of pam- 
phlets on lynching. 

As has been said, for students interested in any aspect of the Negro problem, 
this work is invaluable. It can also be well utilized as a reference for classes in 
social problems in which that of the Negro is touched upon, and in classes in 
economics, sociology and political science, where attention must be given to the 
factor of the Negro in American life. The work, being a product of Tuskegee 
Institute, is marked by the melioristic tone of that institution, but, as has been 
said, whatever bias has been put into the book can be recognized at once, and 
does not therefore materially detract from its value. 

MELVILLE J. HeRsKovitTs 


Columbia University 


The Tenth Annual Report of the Federal Reserve Bank of New York. New York. 
1925. 29 pp. 

The Federal Reserve Agent of the New York district is not given to long dis- 
quisitions on the economic and financial aspects of reserve banking. Within a 
scope of twenty four pages, a liberal amount of which is given over to graphs and 
tables, he summarizes the credit conditions of the year, traces the movements of 
the New York money market, and their relation to the bank, summarizes its 
credit policy and presents sufficient operating statistics to furnish a satisfactory 
record of its activities. A decision to avoid duplication of the statistical material 
appearing in the annual report of the Federal Reserve Board has confined the 
statistics of operation to a comparative statement of condition at the beginning 
and end of the year, a summary of income and disbursements and a table of aver- 
age daily transactions. 

The Federal Reserve Bank of New York is not a regional institution. Its loca- 
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tion gives to its affairs a nation-wide significance and its reports should make a 
strong appeal to all students of finance, particularly when they are presented in 
the clear business-like style of the one under review. The brief concise discussion 
of each topic is accompanied by a graphical presentation of the same material. 

A study of the report makes it clear that, in spite of record breaking domestic 
financing operations and sales of foreign securities, in this market, together with 
the heavy credit demands of a very active stock market, the money market made 
little use of the Federal Reserve Bank until the latter part of the year. In June 
the city members had practically wiped out their indebtedness to the bank but by 
September the inflow of country funds and foreign gold had ceased and resort was 
again had to rediscounting and offerings of banker’s acceptances. The year 
ended with a modest net gain in the Reserve Bank’s earning assets, but it was 
compelled to draw upon its surplus in order to pay the 6 per cent dividend to 
member banks. 

In no year have the four types of movements of funds stood out so clearly, viz. : 
(1) the weekly adjustments of members’ reserve accounts, (2) the monthly money 
movement, (3) the tax period or quarterly movement, (4) the seasonal movement. 
The report describes each of these movements and shows its relationship to the 
Bank’s operations in 1924. 

One lays down the report with the feeling that we have a big problem ahead 
of us in the shape of the loan and investment activities of the reserve banks during 
periods of easy money. The central banks of Europe provide for their continuous 
existence by competing with the joint stock and private bankers for the patronage 
of the public. During most of 1924, the New York Federal Reserve Bank had so 
small a volume of rediscounts that it was compelled to place considerable sums on 
the open market and, even then, was forced to draw upon the surplus in order to 
pay its dividend. Member banks are resenting deeply this competition from the 

teserve System, but when they are not contributing to its support by borrowing 
from it, they will have to countenance its competition in the money market, if it is 
to function along existing lines. 
GeEorRGE W. DowrIle 

University of Minnesota 


Social Insurance. What It Is and What It Might Be, by Alban Gordon, B. Sc., 
F.C.8. London: George Allen and Unwin. 1924. 150 pp. 

Social Insurance Unified, by Joseph L. Cohen. London: King and Son. 1924. 
157 pp. 

The cumbersome machinery by which England collects contributions on a 
stamp and card basis for nearly sixteen million persons protected under the 
health insurance law and at the same time keeps ledger accounts for every one of 
them, and then duplicates all this again for perhaps three-quarters of them on an 
entirely separate card-stamp-ledger basis under her unemployment insurance law, 
was bound sooner or later to lead to plans for unification of administration. To 
Americans who have studied in England the operation of the social insurance 
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system and have visualized the complex millions of card and ledger records, the 
wonder is that demands for simplification were not insistent long ago. 

It is significant that protests against the duplication of machinery are accom- 
panied by proposals from all three political parties to increase the bene‘its and ex- 
tend the system to more people. This applies in general not only to health and 
unemployment, but to old age pensions, accident compensation and the inaugura- 
tion of mothers’ pensions and burial insurance. The success of social insurance 
despite many imperfections, is freely admitted in England. Gordon, who is 
secretary of one of the numerous British insurance societies, places the public wel- 
fare above the selfish desires of any private organization and quotes with approval 
official reports denouncing the wastefulness of private insurance companies. He 
gives a clear constructive treatment of the situation in England, deploring the 
absence of preventive work which he thinks should be linked up with the social 
insurance system, and he advocates really adequate benefits paid through a com- 
prehensive, territorial, unified organization to supplant entirely the degrading 
Poor Law. Incidentally he explains the difference between insurance and a 
“dole’’, the latter expression being “‘an unworthy taunt born of ignorance and 
snobbery.” He believes that it was due solely to unemployment insurance that 
revolution was averted in England during 1920-23, and reports the results of a 
London survey which led to the conclusion: ‘‘ Maintenance without employment 
may be demoralizing, but unemployment without maintenance is much more cer- 
tain in its demoralizing effect.”” Gordon’s book is well written and because of its 
specific criticisms should commend itself to American readers. 


Social Insurance Unified, as the title suggests, covers much the same ground as 
Gordon’s volume on Social Insurance. The two books appeared almost simul- 
taneously and together mark the recent tendency in England to overhaul, extend, 
improve and unify the social insurance system that has developed piecemeal dur- 
ing the past fifteen years. Cohen agrees as to the wastefulness of private insur- 
ance, and from the findings of recent official investigating commissions he con- 
cludes: ‘The case against private insurance seems conclusive.” He agrees with 
Gordon that “insurance by industry” is not likely to be successful, and he pro- 
poses a unified administration which, however, would utilize as its central plan 
the existing public employment service. Cohen presents a wealth of information 
concisely stated in a very convenient volume. 

JoHn B. ANDREWS 

American Association for Labor Legislation, 

New York City - 
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* . ° methods which have been set forth previously is 
Principles of Personal Selling [yr foo ick a. bean 
By Harry R. Tosdal, Professor of Mar- ° 
keting, Graduate School of Business An Introduction to the 


Administration, Harvard University. 


TT’HIS new book really covers the whole field of Methods of 
| 


personal salesmanship and brings out an unusual 


neeption of the fundamental, underlying prin ; M ; 
ce pa lg Sa, ing pein. Economic Statistics 
° ® By William L. Crum, Ph.D., Harvard 
Labor Relations in Industry University, and A. C. Patton, M.A., 
Phoenix Mutual Life Insurance Com- 
By Dwight L. Hoopingarner, Director pany. 
of the New York Building Congress; . 
executive of the American Construc- 2K : ti 
ten Geunch. JRESENTS an essential statistical foundation, 
including definition of terms, collection and 
| OR the first time, here is a book that sets forth editing of data, construction of tables and charting 
1 complete presentation of the entire field of 528 pages Cloth Vet, $4.00 
ibor relations and their administration, reduced to ; 
ts essence It gives a fundamental point of view, 
resents the problems, explains the governing 4 
ene ge Fg og Eg BT Harvard Business Reports 


plans in operation, interprets and shows trends 


553 pages. Cloth Net, $4.00 Compiled by and published for the 


Graduate School of Business Admin- 
Retail Method of Inventory istration, Harvard University. 


N this first volume of the series are published 149 
cases selected from over 3,500 cases collected by 


By Malcolm P. McNair, A.M., Assist- 
ant Professor of Marketing; Assistant 


Director of Bureau of Business Re- the Harvard Graduate School of Business Admin 
search, Graduate School of Business istration through its Bureau of Business Research 
Administration, Harvard University. Each reported case has particular importance as an 


THs volume explains the retail method of inven example of current business practice or as a guide 


tory, which gives you a constant check on stock to sound business management Each case pre 
tages or overages; enables you to plan sales, sents an actual situation which arose in an actual 
»cks, and purchases more intelligently; affords an business, and the actual method by which the 
disputable record for insurance settlement in situation was met 149 significant cases; 561 
se of fire; allows periodical determination of 
rofit, at your convenience 143 pages Cloth pages, size 6 by 9 inches; bound in library buckram 
Net, $2.00 Net, $7.50 


Published by 
A. W. SHAW COMPANY 


CASS, HURON AND ERIE STREETS 
NEW YORK CHICAGO LONDON 





blishers of: Svstem, Fa ry, Br as em, Harvard Busine Review, The Journal of Land & Pu 
luli Econom Book for Busine Executive Busine Textbooks for Schools, Harvard Busine 
Problem Boo Harvard Busine Re ports 
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The High-Speed Machine 
for all 


Adding 

Multiplying 

Dividing 
Subtracting 


ITH a figuring machine capable of high 
speed on both adding and calculating, any 
kind of figure results — scientific, statistical or 
commercial — can be obtained quickly and at low 


cost. 


The Comptometer is not only a high- speed ma- 
chine for both adding and calculating, but it has 
Automatic Safeguards against operating errors 
which make its speed usable with ; safety. 


Operation of the Comptometer is extremely simple 
— no operating lever to pull, no crank to turn, no 
preliminary setting of the keys. Nothing to do but 
drive the keys and read the answer — the machine 
does the rest. 
“Better Methods of Handling Figure Work” 
sent free on request. Write for it 


Feit & TaRRANt Merce. Co., 1715 No. Paulina Ave., Chicago, Ill. 


CONTROLLED-KEY 





ADDING AND CALCULATING MACHINE 
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MERWIN-DAVIS 


CHART AND STATISTICAL SERVICE 


CHART SERVICE 

Charts for lectures, exhibits, directors’ meetings, class 
room work, conventions, legislative hearings, etc. 

Chart illustrations for books, bulletins, pamphlets, 
reports, magazine articles, etc. 

Poster charts for advertising and other publicity. 

Graphic analysis of business records 

Lantern slides. 

Lract preliminary sketches submitted for all chart work. 


STATISTICAL SERVICE 

Preparation of statistical data for graphic presentation. 
Collection of statistical data from published sources 
Analysis of statistical results of special investigations 
Analysis of business records 














A// work under oirection of Margaret S. Merwin ahd 
Helen Davis, for four years in Charge of chart 
department and miscellaneous statistical work of 
the National /ndustrial Conference Board, New York. 


I200 MADISON AVENUE, NEW YORK 
TELEPHONE LENOX 8230 
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The Statistical Tabulation Bureau 
of Washington, D. C. 


Offers a new service for statisticians 


E are prepared to assist in the designing of survey schedules, 

provide the necessary ‘‘punch cards,’ edit and code the 
schedules, and deliver completed tables ready for publication. We 
will also compute either partial or multiple correlation, moving 
averages, or any other statistical computation desired. 
This service places the most approved methods and the use of 
electric tabulating and calculating machines within reach of 
persons having only occasional use for them. Estimates will be 
furnished on request. 


STATISTICAL TABULATION BUREAU 
1301-H St., N. W., Washington, D. C. 























Chaddock’s 
PRINCIPLES AND METHODS 
OF STATISTICS 


488 pages. $3.75 


F. H. HANKINS, Smith College 
“By all odds the best book in print for all who wish 
a complete and effective treatment of the whole field.”’ 


JEROME Davis, Yale University 

‘“‘T consider the book as undoubtedly the best text now 
available for an introductory course in college. It is clear, 
thorough, and especially valuable to the student of social 
problems. Would that all students in sociology were 
thoroughly grounded in it.” 





HOUGHTON MIFFLIN COMPANY 


Boston New York Chicago San Francisco 
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National Industrial Conference Board 


New Publications 


HE most important factor in the development of big in- 
dustries is the attitude of government and public toward 
those industries and their methods of business performance. 

A knowledge of this attitude, and of the changes which, 
during the past generation, have taken place in public olicy 
toward business in general, and business cooperative ad pons 
petitive methods in particular, is essential therefore to every 
business man. 

The most comprehensive studies so far made of the in- 
fluence of anti-trust legislation on organized business in the 
United States are those just published by the National 
Industrial Conference Board. These studies are entitled: 


Trade Associations: Their Economic Significance and 
Legal Status Price $3.00 


and 


Public Regulation of Competitive Practices Price $3.00 


These studies were made under the direction of an able 
Advisory Committee of nationally known business men, econ- 
omists, and lawyers who know the problems inv olved; they pre- 
sent in an authoritative and scientific manner the economic sig- 
nificance of present public policy in the regulation of business. 

Among the subjects discussed are: modern changes in 
methods of marketing and commercial organization, the regu- 
lation of price policies — less-than-cost selling — resale price 
maintenance misbranding trade mark simulation; the 
regulation of trade relation policies, etc. 

At the recent National Conference on Trade Associations 
and Business Combinations held under the auspices of the 
Academy of Political Science in New York, these volumes 
received most favorable comment. 


Copies of both books may be ordered direfly from 


National Industrial Conference Board, Inc. 


247 Park Avenue, New York City 
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A Prize of 
Five Thousand Dollars 


is offered for the best 
adverse criticism of 


PROFITS 


By WILLIAM TRUFANT FOSTER and WADDILL CATCHINGS 


(Authors of ‘‘Money’’) 





‘Profits’’ presents a far-reaching criticism of the existing economic order, and 
arrives at rather startling conclusions. As the authors wish to build on whatever 
is sound in this book, they are eager to find out, as soon as possible, the worst that 
can be said against their theories. That is why the prize is offered. The argu- 
ment concerning which adverse criticism is particularly invited is this: 

‘Progress toward greater total production is retarded because consumer buy- 
ing does not keep pace with production. (Consumer buying lags behind for two 
reasons: first, because industry does not disburse to consumers enough money to 
buy the goods produced; second, because consumers, under the necessity of sav- 
ing, cannot spend even as much money as they receive. There is not an even 
flow of money from producer to consumer, and from consumer back to producer. 
The expansion of the volume of money does not fully make up the deficit, for 
money is expanded mainly to facilitate the production of goods, and the goods 
must be sold to consumers for more money than the expansion has provided. 
Furthermore, the savings of corporations and individuals are not used to pur- 
chase the goods already in the markets, but to bring about the production of 
more goods. Under the established system, therefore, we make progress only 
while we are filling the shelves with goods which must either remain on the shelves 
as stock in trade or be sold at a loss, and while we are building more industrial 
equipment than we can use. Inadequacy of consumer income is, therefore, 
the main reason why we do not long continue to produce the wealth which 
natural resources, capital facilities, improvements in the arts, and the self- 
interest of employers and employees would otherwise enable us to produce.” 





everywhere. Essays must be in English, 


The contest is open to everyone, 
r 1, 1927. 


and must be sent to the Pollak Foundation before January 

The Judges are: OWEN D. Younc, Chairman of the Board of Directors of the 
General Electric Company; WerEsLEY C. MITCHELL, of Columbia University; 
ALLYN A. YounG, of Harvard University. 


No one need buy the book, to enter the contest. The book 


may be ordered, however, ($4.00 postpaid) directly of the 


POLLAK FOUNDATION for ECONOMIC RESEARCH 


Newton 58, Massachusetts 



































Statisticians have been looking 
for this unusual service 


| begpeenagel BUREAU offers a unique serv- 
ice to statisticians through its Statistical 
Service Division. 


This division will record and tabulate all 
kinds of statistics for you. Its experienced 
operators, aided by large batteries of calcu- 
lating and tabulating machines, will expedite 
the preparation of any kind of statistics with 
a speed, accuracy and intelligence that will 
amaze you. 


It will deliver regular routine monthly sta- 
tistics on a definite date, or special statistics 
in far less time than you can possibly prepare 
them yourself. 


It will save you the expense of leasing or 
buying machines, hiring and training opera- 
tors and worrying about keeping them busy. 

Statisticians have found that it is an econ- 
omy to employ L. B.’s Statistical Service 
Division. We will be glad to give you full 
information about this service on any kind 
of statistical and research work. 


Statistical Service Division 


Library Bureau 


New York, 380 Broadway 


Boston, 89 Federal St. Detroit, 2210 Park Ave. 
Philadelphia, g10 Chestnut St. Montreal, 27 Board of Trade Bldg. 
Chicago, 214 W. Monroe St. Toronto, 215 Richmond St., West 








RUMFORD PRESS 
CONCORD 

































